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EPIC™ XE-900 ae 
1.0 GHz CPU Gi operat | 1.64 mas | 6A mas 


Need Linux, ONX, Windows®? 
Try our OS EMBEDDER™ KITS 


Our kits are the shortest path to 
a successful OS on an Octagon 
embedded computer. 

* Pick your Octagon SBC 

* Pick the O§$ you prefer: Linux, 

Windows, ONA 

Octagon delivers a high 
performance, total solution. 
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Need PC/IO4 expansion? 


YRIOMcC® X-SRAM-2 MB 
Try our XBLOKsS 

y * 2 MB high speed, SRAM 

* Read and write at full bus speed 

* Pointers to memory saved if CPU 


resets or loses power 





A-D!IO—48 bit programmable 
digital I/O 
* 48 digital I/O, 5V compatible 
* Source and sink 16 mA per output 
* Direct connection to 
opto-module racks 


X-COM-2 dual UART 

* Up to 230.4 kBaud data rate 

* Supports RS—232/422/485 

* RS—485 fault protected to t60V 


X-LAN—!| Ethernet LAN 
* [0/100 Base—T, Intel 8255/ER 


XBLOKs offer the best compromise XELOK® XBLOK : Fully PAIg=n=p ay 
in cost and function for both PC/104 Yan ce ° Pilg Parormance, 
and PC/104-Plus. Only 44% the size PCI bus interface 
of a standard PC/|04 card, you can 
add two functions to your system 
but increase the stack height by 





X-USB-—4 quad USB 2.0 
* Speeds up to 480 mbps 
* Mix and match USB I.1 and 2.0 





only one level. —40° to 85° C. Heat PC/IO4 CPU card — - Current-limited ports can supply 
diagram shows enhanced cooling. Er: aes £00 mAé to external devices | 


Need a fanless system? 
NEW CONDUCTION 
COOLING SYSTEM 


Designed for the XE-900. For a full listing of 
our conduction cooling system Octagon Systems 
eliminates a fan even at 1.0 GHz. products, visit us at 


www.oct ago nsy stems.com 
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SBS knows Modular Computing. Careful design yields soeed & thermal efficiency. 





BY CLEVERLY INTEGRATING best-of-breed 2 MB integrated L2 cache and a 533 MHz front 
silicon, SBS Technologies® has created a side bus. Extreme silicon: DDR2 400 MHz SDRAM, 


= ™ processor AdvancedMC™ that fully exploits the the Intel® E7520 chipset with 533 MHz 
AvanceaMC potential of this new form factor. Think of it as system bus to the processor, 
AdvancedMC Extreme. 








plus one PCl Express x4 
bus with 2Gbytes/s transfer 


Intel’ There's extreme bandwidth: rate to Ethernet and another PCI 
Communications parallel PCI Express x8 lanes, Express x8 bus with 4 Gbytes/s transfer 
prance dual Gigabit Ethernet links, and rate to the AdvancedMC interconnect. 

ee two SATA channels to the carrier. 
Extreme processing power: the latest 2GHz The new Telum ASLP10 processor AdvancedMC 
embedded Intel® Pentium® M processor with from SBS Technologies. Beautiful, isn’t it? 





SBS de ae 


Technologies. 


Find the AdvancedMC board you’re looking for at www.sbs.com or call 800.SBS.EMDEDDED 
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A major trend in high-performance, real- 
time data acquisition boards and subsys- 
tems is the move toward more flexible 
interfaces. For example, the 400 Mbyte/s 
StreamStor Amazon SATA Disk Control- 
ler board (bottom) from Conduant uses a 
modular, mezzanine approach to external 
interfaces for direct-to-disk recording: 
optional interchangeable daughtercards 
(top) for interfaces such as FPDP, Serial 
FPDP, FPDP II or the PCI bus. ¢ Pg. 14 
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System-tracing tools can be used to opti- 
mize performance in a multicore system. 
e Pg. 46 





Serial RapidlO Distributed Switch Solution 
for VME Systems e Pg. 66 
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-Operate ano 
survive under | 
the most extreme : 
conditions with f 
ruggedized E-Disk® 
solid-state flash drives and 
network storage solutions. 
BiTMICRO’s cutting-edge 
storage technologies offer utmost 
reliability, optimum data security and 
unmatched performance. = 


Ethernet | Fibre Channel | SCSI | IDE/ATA 
USB | FireWire | cPCI VME | SATA | iSCSI 
PCI-X | PCI Express | SAS | Infiniband 


BiTM IC Oz2- BiTMICRO Networks, Inc. 8 www.bitmicro.com 
i 45550 Northport Loop E + info@bitmicro.com 


ULTIMATE STORAGE SOLUTIONS™ Fremont, CA 94538-6481 510-743-3475 











MISSION CRITICAL...... 


data storage modules 





—— a 


Extreme Comprehensiveness: We offer the most comprehensive VME/cPCI 
storage product line in the world, offering device alternatives for 
any standard or unique application. 
¢ Solid State Disk ¢« Removable Hard Disk 

e Tape Drives ¢ Optical Disk « PCMCIA Adapter 
Extreme Performance: Our VME products feature extreme speed, capacity and 
ruggedly reliability with 320 MB/sec throughput enabled by | 
LVD SCSI technology, storage capacity of more than 600 GBs fa. 
per module and a 1,400,000 hour MTBF. 'G) ke 
Extreme Quality: Phoenix International is the only 
manufacturer of VME data storage products that is y a 
ISO 9001:2000 Certified. ae, 


- 
= 





ea SGS : 





IHN TERN ATION AL 


Phoenix International Systems, Inc. An ISO 9001:2000 Certified SDVOSB 
714-283-4800 © 800-203-4800 ¢ www.phenxint.com 
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multi-core processors” 


Dan Dodge. QNX CEO & CTO. 
Pioneer in distributed and multiprocessor computing. 








Introducing the QNX® Momentics® development suite 
Multi-Core Edition, the industry’s most comprehensive 
software platform for multi-core systems. Powered by the 
massively scalable QNX Neutrino® RTOS, this fully integrated 





solution supports AMP, SMP, and BMP, a groundbreaking 
technology that simplifies code migration and future-proofs QNX Unlocks the Power of Multi-Core 


your designs for quad-core and beyond. It’s the latest 


innovation from QNX Software Systems, the undisputed _ a 
Maximize performance. Eliminate complexity. 


leader in multiprocessing technology. Accelerate migration. Only QNX offers: 


Asymmetric Multiprocessing (AMP) for full 
Maximum Choice for Multi-Core developer control and fault tolerance 


Symmetric Multiprocessing (SMP) for maximum 
concurrency and scalability 










Only QNX gives you the power to choose the best multiprocessing 
model for your multi-core design: 





Bound Multiprocessing (BMP) for the fastest 
code migration and minimum design complexity 
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Scalable beyond dual-core Limited 


Transparent Inter-Processor Communication 
= v (TIPC) protocol for seamless Linux connectivity 
Dedicated processor by function -_ Y Y 
Inter-core messaging Fast Fast Slower System tracing tools for fast debugging and 

(OS primitives) (OS primitives) (application) optimization of multi-core applications 
Thread synchronization between cores “4% Y = 
Dynamic load balancing Y Vv a 


System-wide debug & optimization Y Y _ 

















Mixed OS environment -_ 


Off-the-shelf BSPs for multi-core platforms based 
on MIPS®, PowerPC®, and x86 architectures 


Discover how Dan and the QNX team deliver the shortest 
migration path to multi-core. Call 1 800 6/76 0566 or 
visit www.qnx.com/innovate. 


QNX SOFTWARE SYSTEMS 


QNX, Momentics, and Neutrino are trademarks or registered trademarks of QNX Software Systems GmbH & Co. KG and are used under 
license by QNX Software Systems International Corporation. All other trademarks belong to their respective owners. 301786 MC339.05 
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you need at info@vadatech.com. 
Share your vision with us. We’ll customize 


Our products to your requirements or partner e Comprehensive Hardware/Software 


with you to develop custom solution for Intelligent Peripheral 
oroducts all the way Management Interface (IPMI) version 2.0 







» through deployment. e Symmetric Multi-Processing CPU for 
ni Either way, you'll AMC/VME Modules 
, get leading-edge . A complete line of ATCA 
7 board level and AMC carriers 
— | e High dynamic range A/D Converters 


vadatech. 


THE POWER OF VISION 





www.vadatech.com 702.896.3337 
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Editorial 
January 2006 


Penguins on the Ice 


by Tom Williams, Editor-in-Chief 


the Penguins, but I have seen one image of penguin behavior 

that seems indicative of where one part of our industry stands. 
That’s where a bunch of penguins are crowded up at the edge of 
the ice trying to decide whether it’s safe to go into the water. If 
there are sharks or orcas down there, of course, everybody will 
take a pass. What usually happens 1s that there is such a crush that 
one penguin eventually gets pushed over the edge. If he doesn’t 
get eaten, the others decide it’s OK to get into the water. The wa- 
ter, of course, is where the food is. The question is simply who is 
going to be the food. 

This seems to be where the industry is at this moment in 
terms of ATCA and Micro[CA. A number of CEOs I have talked 
to recently keep saying things like, ““We see an enormous poten- 
tial,” and, “Our customers are taking a close look and testing the 
market.” A number have even jumped into the water, and while 
they haven’t exactly been gorging themselves, they haven’t been 
eaten either. The last telecom debacle was traumatic and there is 
still a good deal of bad-mouthing of telecom going on in some 
quarters today. But is the outlook really as discouraging as some 
would have us believe? 

For one thing, let us remember once again that what we re- 
fer to as “telecom” today is a different animal than it was in the 
past. The telecom of the future is a fully digital, IP packet-based 
broadband communications network whose infrastructure is still 
in transition from the old POTS world of the late Ma Bell. The de- 
mands that will be placed on this network would have completely 
overwhelmed the plain old telephone system of yesteryear. 

The recent Consumer Electronics Show in Las Vegas may 
be one indication of what is starting to pull at the telecom in- 
dustry. There the talk was of high-definition video and multi- 
media, wireless connectivity, movies on demand and pervasive 
connectivity. Products are starting to emerge that will depend on 
the existence of pervasive broadband networking. The consumer 
market, however, is only one indicator. The needs of the finan- 
cial sector will drive the build-out of the infrastructure as will 
advances in medical imaging equipment. The latter can produce 
3D layered images of minute detail in color and some in real 
time. There is a growing need to share these expensive machines 


f. say up front that I have not seen the movie, March of 


among remote experts who will need access to their data. Indus- 
trial companies will increasingly need to distribute design data 
and organize teleconferences. 

The specialized nature of many of these applications will push 
the value-add from merely supplying high-speed pipes to provid- 
ing services closer to the application in the form of middleware or 
protocol conversion or lower-volume, more specialized hardware. 
That makes it attractive to build out the bulk of the basic infra- 
structure using open standards hardware. Of course, the coun- 
ter argument to this always is that such hardware will be driven 
to commodity status where only huge volumes will be profit- 
able, putting it in the hands of only a few very big players—if 
it is successful at all. Commoditization will surely happen. The 
question is whether in such a potentially large market there is 
room for makers of specialty hardware in the same form-factor or 
for makers of configuration modules (e.g., AMC and Microl'CA) 
that will, of necessity, sell at higher margins but fit into the com- 
modity backplanes and carrier boards. 

Everybody takes glee in dissing Windows, but Microsoft’s 
near monopoly has provided fertile ground for a huge number 
of successful companies, spreading wealth in all directions. By 
the same token, if there is a base infrastructure founded on the 
products of a relatively few large commodity suppliers, it is cer- 
tainly possible that more specialized suppliers of software and 
hardware will be able to build on such a foundation. 

Instead of wringing our hands about whether the new 
“telecom” industry will take off, perhaps we should think about 
where to attach ourselves when it does. The water is full of fish 
and not all of them are appetizing to orcas. There will be many 
specialized needs that can be served by a brand new digital 
telecom infrastructure and it will require imagination and daring 
to identify and serve them, which will demand new hardware and 
software that can play in the arena. If you are the penguin on the 
ice that gets pushed in first, there is a risk you might be eaten, but 
there is also a very good chance you'll get first choice of the best 
fish. Then again there are some companies (if not penguins) who 
jump in with the sole intention of being eventually swallowed. 
That doesn’t have to be a bad thing either. 4 
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unless you have a long-term com- 
mitment from your SBC vendor. VersaLogic guarantees product availability for at least five years 
from its introduction date.! That saves you from tossing future sales into the jaws of competi- 
tors because you suddenly cannot get your hands on the boards you need, when you need them. 
VersaLogic’s Extended Life-Cycle Policy is just one of the services that helped 


us earn the prestigious VDC Platinum Vendor award. From our close ties with key 





vendors to our pre-design component studies, we make It our business to extend 


Before Kristin Koons joined 


your product’s life. So sink your teeth into the details at www.VersaLogic.com/ eae bee 
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PICMG Responds to AMC Editorial 


While your November editorial regarding the 
AMC specification (“Flurries Around AMC Ap- 
pear to be Letting Up”) is pretty much correct 
factually, there’s a tone suggesting a narrowly 
averted disaster and a continuing threat to the 
specification. | don’t think that anyone directly 
involved in the process shares your view. What 
is happening to AMC is normal. The first ma- 
jor version of the spec has been released, and 
the first wave of refinements is being proposed 
through the normal engineering change request 
process. AdvancedTCA is moving through this 
process as CompactPCl did before it. 

Take the connector situation, for instance. 
Your piece says that the variety of carrier con- 
nectors that can be used has been broad- 
ened, but it is more precise to say that a 
wider variety of connectors that are explicitly 
compliant with the spec has been identified. 
All of the connector types that are being ex- 
plicitly added can be used to make perfectly 
serviceable AMC carriers whether or not they 
make it into the spec. Contrary to the implica- 
tion of your editorial, the compression mount 
connector from the first version has not been 
removed from the specification. 

As for other change requests that might 
have threatened backward compatibility, | don’t 
believe that there was ever a serious possibil- 
ity that they would have been incorporated. 
The issues that they are intended to address 
have been carefully examined, and are being 
resolved in a way that preserves backward 
compatibility. 

The implication that some variations on 
MicrolTCA might be threatened by IP is also 
overblown. Any IP that is incorporated in the 


MicroTCA specification will be available under 2 
reasonable and non-discriminatory licensing 
terms in accordance with PICMG policy, a pol- 
icy that is essentially identical to all other ma- 
jor standards and specification development — 


organizations. 


While PICMG normally does not comment 
on the activities of other standards/specifica- , 
tion development organizations (and would ap- : 
preciate reciprocity from them), we have taken 
note of the VITA 56 activity and the many ways — 
in which it draws upon the pioneering work of 
AMC. This is not the first time VITA standards | 
have drawn on the work of PICMG (witness the — 
adaptations of PICMG 2.9 and 2.16 to the VME | 
environment), nor would we wish it to be the 
last. We believe that imitation is the sincerest 
form of flattery. It is true that VITA 56 is tar- | 


geted for use in VME, VXS and VPX systems, 
and is therefore smaller in its depth dimension. 


The power distribution strategy is different as 


well. It does not necessarily follow, however, 


that VITA 56 is targeted at a different market 
than AMC and MicroTCA. | don’t believe that ei- 
ther the AMC or VITA 56 communities are ready 


to concede applications spaces to the other. 
That’s something to watch! 


In closing, I’d like to thank you for seeking 
balance between the voices of alarm and those 
of reason. | think you made it most of the way | 


in terms of the facts reported in your editorial. 


As I’ve said, | think the tone still needs a bit of 
adjustment and I’ve tried to provide what | think : 


is the proper spin on what is happening. 


Dick Somes 
PICMG Technical Officer 


Late-Breaking Results of Shock and Vibe Tests In 


Tom: 

When we talked a couple of months ago 
about the status of the changes to the AMC 
specification, the shock and vibration tests 
were not complete. They are now, and I’m happy 
to report that all shock, vibration and seismic 
tests passed with flying colors. The test re- 
port can be accessed at the PICMG Web site: 
www.picmg.org. 


The worst thing that happened was a cou- 
ple of screws in the side of one of the test chas- 
sis came loose, but that was unrelated to the 


AMC or its carrier. 


So, we don’t have a mechanical problem | 


and will not be doing any mechanical re-design. 


Cheers, 
Joe Pavlat, President, PICMG 





Radio Resource 
Management Spec for IEEE 
802 Wireless LAN Passes 
Milestone 

The IEEE 802.11 Working Group 
has passed a major milestone in 
the development of IEEE 802.11k, 
“Wireless LAN Medium Access 
Control (MAC) and Physical Layer 
(PHY) Specifications: Radio 
Resource Management of Wireless 
LANs,” by voting to accept a 
draft radio resource measurement 
document as a baseline for the final 
standard. 

Once completed, IEEE 802.11k 
will allow enhanced measurements 
and diagnostics for IEEE 802.11 
wireless local area networks 
(WLANs) that operate in the 
unlicensed 2.4 GHz (ISM), 4.9 
GHz (Japan) and 5 GHz (UNII) 
bands. This amendment to 
the IEEE 802.11 base standard 
will enable more accurate and 
efficient operation of WLANs 
in governmental, _ residential, 
enterprise and metropolitan 
settings. 

“Next generation video streaming, 
wireless VOIP and dense WLAN 
deployments present new challenges 
that call for more precise WLAN 
measurements,’ says Stuart Kerry, 
TEEE 802.11 Working Group 
Chair. “IEEE 802.11k will help 
optimize these radio environments 
SO more devices can coexist even as 
it reduces wireless network traffic 
congestion. Final approval of this 
amendment is targeted for January 
2007.” 


Brooktrout and tekVizion 
Announce Strategic 
Partnership 

Brooktrout Technology and 
tekVizion PVS have announced 
that Brooktrout has become a 
Premier Tenant in tekVizion 
Labs, the company’s pioneering 
third-party testing, qualification 
and interoperability facility for 
carrier-grade IP applications and 
infrastructure. Premier Tenant 


January 2006 F:N@ 11 


The Power of 4 A 


The New Virtex-4 range from Nallatech 


BenNUEY-PCI-X 








>> PCI-X DIME-II Motherboard 

>> On-board Xilinx Virtex-4 FX user FPGA 
>> 64 bit/|33Mhz PCI-X Interface 

>> 3 DIME-II module expansion slots 

>> 18 Mbytes DDR-II SRAM 

>> Available in Windows and Linux 


>> DIME-II Expansion Module 

>> Dual on-board Xilinx Virtex-4 LX 
User FPGAs 

>> Up to 300+ logic cells per module 

>> 64 Mbytes DDR-II SRAM — 8 BANKS 

>> |6 Gbytes/sec total SRAM memory 
bandwidth 

>> Includes DDR-II SRAM FPGA IP Core 


>> DIME-II Expansion Module 

>> On-board Xilinx Virtex-4 LX or SX 
User FPGA 

>> Quad 12-bit 250 MSPS analog 
capture channels 

>> 16 Mbytes DDR-II SRAM — 2 BANKS 

>> Includes DDR-II SRAM FPGA IP Core 


>> DIME-II Expansion Module 
>> On-board Xilinx Virtex-4 LX or SX 
User FPGA 
>> | Gbyte DDR2 SDRAM — 2 BANKS 
>> 16 Mbytes DDR-II SRAM — 2 BANKS 
>> 8 Gbytes/sec total memory bandwidth 
>> Includes SRAM and SDRAM FPGA IP Cores 


To find out more please visit - www.nallatech.com/virtex4 


A. NALLATECH 


The High Performance FPGA Solutions Company 


North America Toll free: |-877-44-NALLA 


EMEA & ROW Phone: 


+44 (0) 1236 789500 
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Event 


Calendar 
02/06-09/06 


Components for Military & 
Space Elec. Conference 
Los Angeles, CA 
www.cit-uS.com 


02/06-09/06 
DesignCon 2006 
Santa Clara, CA 
www.designcon.com 


02/15-17/06 

AUSA Winter Symposium 
and Exhibition 

Ft. Lauderdale, FL 
www.ausa.org 


02/22-23/06 
AFCEA Homeland 
Security Conference 
Washington, DC 
www.afcea.org 


02/28/06 

Real-Time & Embedded 
Computing Conference 
Melbourne, FL 
www.rtecc.com/melbourne 


03/02/06 

Real-Time & Embedded 
Computing Conference 
Huntsville, AL 
www.rtecc.com/huntsville 


03/05-10/06 
Optical Fiber Comm. 
Conference & Expo 
Anaheim, CA 
www.ofcnfoec.org 


03/07-09/06 
Intel Developer Forum 
San Francisco, CA 
www.intel.com /idf 


03/14-15/06 

Military Technologies Conf. 
Boston, MA 
www.miltechconference.com 


03/20-23/06 

Nat’] Manufacturing Week 
Chicago, IL 
www.manufacturingweek.com 


If your company produces any type 
of industry event, you can get your 
event listed by contacting 


sallyo@rtcgroup.com. 


This is a FREE industry-wide listing. 


status allows Brooktrout’s 
customers and partners to test 
Brooktrout’s SnowShore IP Media 
Server with a variety of other IMS 
components in tekVizion Labs 
prior to deployment. 

tekVizion helps service 
providers achieve a _ smooth 
transition to packet-voice 
networks, speeding delivery 
of new integrated services. 
tekVizion Labs offers several 
services including  tekVizion 
and third-party certification 
testing, remote testing, solution 
testing, IMS conformance and 
interoperability testing, product/ 
compliance assessments and 
outsourced testing. Remote testing 
can be used for pre-certification 
or ad-hoc test scenarios, whereas 
formal tekVizion certification is 
conducted on-site at tekVizion 
Labs by tekVizion engineers. 
Certification can be provided 
between two products or in a 
multi-product solution. All lab 
services are available to both 
vendors and service providers. 

Brooktrout’s SnowShore IP 
Media Server is a_ software- 
based, carrier-grade IP media 
server supported by a _ wide 
range of industry — standard 
hardware platforms running on 
Red Hat Linux. It leverages SIP, 
VoiceXML and MSCML to 
provide a_ cost-effective and 
scalable IP media server solution, 
powering a broad range of voice 
and video services for next- 
generation wireline, wireless and 
broadband networks including 
the 3GPP/3GPP2 IP Multimedia 
Subsystem (IMS) network 
architecture. 


QuickLogic Signs OEM 
Agreement with Mentor 
Graphics on FPGA Design 
Mentor Graphics has 
announced a multi-year OEM 
agreement with QuickLogic. The 
agreement offers QuickLogic’s 
customers a new __ synthesis 
solution, and provides for a 
smooth transition to the full range 
of advanced FPGA synthesis 
technology and _ tools from 





Mentor Graphics. Through the | 
agreement, FPGA designers get — 
immediate access to a broad range 
of performance and productivity | 

Mentor 
Synthesis 
QuickLogic | 
QuickWorks FPGA development — 
Using | 
a highly interactive graphical | 
environment, designers gain the 
flexibility to cross-probe between 
HDL and schematic views. The | 
tool performs “what if” timing 
analysis with instant feedback, — 
to make 
confident decisions for fast and 
provide an interface to other , 


benefits using the 
Graphics Precision 
tool within the 


software environment. 


enabling designers 


accurate timing closure. 
Precision 


into the QuickWorks 
development software in early 
2006. The new 
Synthesis 


programmable logic 


Synthesis iS 
scheduled to be fully integrated | 


FPGA ; Curtiss-Wright Earns 


_ AS9100 Aerospace Quality — 


_ at additional operational sites — 


amar : throughout the United States and 
_ Embedded Computing has been _ the United Kingdom. The benefits 
- | of AS91100 accreditation for | 
eae Curtiss-Wright include expanded i 


_ Precision — § ystems Certification 
QuickLogic Edition | 


supports QuickLogic’s leading | 
devices, | 


aes _ awarded AS9100 
including Eclipse I, Eclipse I, | 


~ QuickMIPs and QuickPCI. Beta — 
— Ottawa, 


announced AS9100 is 


support is also available for the 
company’s newly 
PolarPro devices. 
QuickWorks supports Windows, 
Sun Solaris 


tutorials, logic synthesis, place 


and route, timing analysis and | 


simulation support. QuickLogic 
has partnered with leading 
software vendors to provide 
industry-leading Synthesis 
Simulation tools, as well as 


industry standard EDA tools. 


Curtiss-Wright 


and Linux-based | 
operating systems and provides a 
design environment ranging from | 
schematic and HDL-based design | 
entry, HDL language editors and 
_ reductions 


for its Leesburg, Virginia and 
operations. : 
overseen by the | 
_ International Aerospace Quality — 
Group and standardizes quality 3 
| management system requirements 
and delivers quality assurance in 
design, development, production, 
servicing. 
The standard also drives cost | 
throughout the 
aerospace industry supply chain. 
 Curtiss-Wright’s 
| program was directed by Gerry — 
Bellehumeur, : 
and Curtiss-Wright Controls Embedded 
_ Computing, Ottawa. The AS9100 — 
audit was performed by TUV | 
_ America. 
Curtiss-Wright has announced 3 
_ that it has developed plans to 
_ implement the AS9100 standard 


Ontario 


installation and 


Quality 


accreditation 3 


Director, | 


INDUSTRY INSIDER 


market access as a result of a 
standardized process system 
recognized across the aerospace 
industry. Because AS9100 
drives ongoing improvements 
in products and processes, it 
reduces errors and returns and 
increases customer satisfaction, 
resulting in reduced transaction 
costs. 

AS9100 compliance exceeds 
the [SO9001:2000 — quality 
standard on which it’s based, 
with additional quality system 
requirements such as independent 
validation of materials and 
processes. It adds approximately 
80 additional requirements 
and 18 amplifications to the 
ISO 9001:2000 Standard. The 
standard addresses the unique, 
complex and highly regulated 
nature of the defense aerospace 
industry. 


Is Limited Storage Becoming a Problem? 
Introducing AdvancedTCA Storage Platforms trom DTI 


To Learn more about AdvancedTCA (ATCA) and DTI’s Products based on this standard 


Call 1.800.443.2667 or visit our website at www.atcatogo.com 


DTI’s Targa-14E system 
provides a complete 
ATCA platform with 
increased performance 
and storage capabilities. 


| IBV Ky (To! 
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Data Acquisition 
Subsystems Getting Faster, 
Interface-Agnostic 


More and more embedded applications rely on next-generation, 
high-performance data acquisition subsystems to gather large 
quantities of data at high speeds and convert it into usable form. 
The use of switched fabrics, FPGAs and network topologies, along 
with a move toward flexible interfaces and vendor interoperability, 
are helping designers of these subsystems keep pace. 


by Ann R. Thryft 
Senior Editor 


s high-performance embedded com- 
Ain systems continue to achieve 

new levels of functionality and per- 
formance, larger amounts of data must be 
gathered, processed and analyzed. Ap- 
plications such as radar data acquisition 
and video-centric imaging, for situational 
awareness on the battlefield or high-speed 
manufacturing lines, have vastly increased 
the amount of data and the speed at which 
it must be processed. 

For example, next-generation radar 
data acquisition systems are being built 
for advanced aircraft such as Northrop 
Grumman’s E-2D Hawkeye. These will 
help the U.S. Navy’s Sea Strike offen- 
sive capabilities by increasing battlespace 
awareness, providing theater air missile 
defense capabilities, improving detection 
and tracking, and narrowing the link be- 
tween sensor and shooter for more agile 
response to time-sensitive targets. 





Get Connected 
with companies mentioned in this article. 
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A major trend in high-performance, real-time data acquisition boards 
and subsystems is the move toward more flexible interfaces. 
For example, the 400 Mbyte/s StreamStor Amazon SATA Disk 
Controller board (bottom) from Conduant uses a modular, mezzanine 
approach to external interfaces for direct-to-disk recording: optional 
interchangeable daughtercards (top) for interfaces such as FPDP, 
Serial FPDP, FPDP II or the PCI bus. 
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Your new IMS infrastructure applications using Kontron 
AdvancedTCA / AdvancedMC modular solutions. 


Lat 


Kontron simply takes the worry - and the expense - out of building complex IMS 
communication Bat orms for fixed-mobile networks. Whatever the application, 
you can design and deploy your project faster than you think with | ully integrated, 


IPT standard modular solutions thal are apolica hon -feady, right or rh e shelf, 


lhat means reduced dev elopment costs for you, and tremendous “swap in- swap out" 
Hervice flexibility for your carrier ce oe [t's a very smart win-win go-to-market 
strategy for everything from data and signaling platforms to IP streaming multimedia 


cl ool ications for video-o n-emand, real-t ime Wold Pe and video te Le or Tey. 


[t's so simple. Start your next application with Kontron, 
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Manufacturers of data acquisition 
boards and subsystems are leveraging 
existing hardware off-the-shelf building 
blocks—such as intelligent I/O control- 
lers, embedded switched fabric intercon- 
nect and high-speed fiber interface PMC 
modules—and adding existing software to 
keep costs and risk down. Combined with 
the customization capabilities of onboard 
FPGA-based IP, these components are 
producing subsystems that meet aggres- 
sive size, weight and power constraints 


while also delivering high performance. 
Meanwhile, real-time video imaging 
has arrived on the data collecting stage 
in a big way. Video is playing an increas- 
ingly central role in capturing additional 
data and monitoring operations. On the 
factory floor as well as on the battlefield, 
data must be processed and fused in real 
time and made available simultaneously 
to all of the system’s nodes. Here, the 
challenge for design engineers is to main- 
tain the low latency required for real-time 


— 
eee DEVELOPMENT 
SOLUTION 


Arium offers robust JTAG emulation 
and development tools for today's 
embedded software engineers using 
targets with ARM7"/ ARM9"/ARM11", 
Intel XScale’, and Tl OMAP” cores and 
Intel’ Pentium® processor families. 


¢ Full symbolic, source-level Linux kernel 


data capture and distribution, in addition 
to the high I/O throughput rates needed to 
handle large video streams and preserve 
data accuracy. 

Distributed shared memory network 
architectures based on a ring topology are 
being utilized to construct high-speed I/O 
networks that not only meet these needs, 
but also enable remote processing far from 
the harsh environments where the gather- 
ing of data occurs. Data is captured at 
multiple stations and sent to several pro- 
cessors, each of which processes different 
pieces of that data simultaneously. 

On another front, a major break- 
through has occurred in the IF data 
transfer interface. The emerging VITA 
49 standard defines a standard way of 
transferring IF data in a digital, link-ag- 
nostic format between analog front-ends 
and DSP subsystems. Instead of depend- 
ing on application- and/or equipment- 
specific interfaces to transmit digitized 
incoming analog signal data to system 
elements, the new format defines a data 
structure for the transmission of digital 
IF data between multiple sources and 
destinations for both receive and trans- 
mit paths. The standard’s methodology 





debug and source-level process debug; 
seamless debug between them! No 
other vendor offers this powerful 
feature at any price. 


for representing digital IF data can be 
layered on top of any transport protocol 
or physical communications link. 

The ramifications are clear: OEMs 
will no longer be tied to vendor-specific 
interfaces, but can select interoperable 
system and subsystem components from 
many vendors based on which ones best 
fit their applications. As an added bonus, 
designers no longer must rework system 
hardware and software each time a source 
or destination component changes, thus 
speeding time-to-market. @ 


- Real-time, integrated ETM 
trace data collection at 640 
MHz and a GByte of trace 
memory. 


* Highly integrated 
SourcePoint™ IDE with pow- 
erful, flexible code editing 
with debug integration. 


| + Real-time performance 
analysis for faster, more 
accurate results. 


- Fast, easy, intuitive run control with 
robust C-like command language 
facilities. 


¢ SourcePoint debugger available for 
Microsoft® Windows’ and Linux hosts. 


Ek american 
It's hard to compete without the right tools! (ANE rium 


American Arium * 14811 Myford Road ¢ Tustin, CA 92780 * 877.508.3970 * www.arium.com 
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An Entire Family of High-Speed, Low-Power Universal Intel® 
Pentium® M Processor Boards 


The cPCI-6840 family combines Intel's latest high 
performance low power processor the Pentium® M with the 
855GME MCH and two different I/O controller hubs. The 
" ail Fas m Pentium® M has a highly efficient instruction execution unit 
as , 3 ae ‘3 ~—s and coupled with a large L2 cache, it can outperform other 
| | ‘y | hy processors at the same clock speed. The 855GME has an 
integrated 250MHz 32-bit 3D/2D graphics engine capable of 
3 Vd supporting simulations CRT and flat panel displays. The ICH 
3U CompactPCI Board based on Intel® Pentium® M = — oe | — His = Packs a plethora of peripherals include Serial ATA 150, PATA, 


Processor with ntel®855GME Chipset P — USB 2.0, watchdog timer and serial port. 


The cPCI-3840 is a 3U CompactPCI Pentium® M For more info, go to: 
processor board. Combined with embedded chipsets www. adlinktech.com/products 
855GME and 6300ESB, the cPCI-3840 offers up to 2GB 

DDR 333 memory with XVGA and flat panel graphics, 

dual Gigabit Ethernet ports, and several I/O features. It 

delivers higher computing power but at low-power 

consumption. The cPCI-3840 is specially designed for 

industrial automation and control system applications. 


remem fo small Package Loaded with 
Computing Power 
A New 64-bit, 8-slot 6U backplane with 


Redundant Power Supplies in a 4U Enclosure 


ETXexpress Computer-on-Module with Intel® Pentium® M 
Processor 


As the first member of ADLINK's ETXexpress family, 
the ETXexpress-lA533 uses a low power Intel® 
Pentium® M processor 760 at 2.0GHz and the Mobile 
Intel? 915GM Express Chipset. Both the chipset and 
processor are part of the embedded Intel® 
Architecture that ensures a long production life for 
applications that need extended availability. The 
ETXexpress-lIA533 supports dual channel DDR2 
533MHz memory and comes with a single 

on-board Gigabit Ethernet port. In addition to the 
onboard integrated graphics, a Graphic PCI Express 
x16 slot is also available. The board connects up to 
four additional PCI Express x1 devices. The module 
has legacy support for 32-bit PCI and ISA through 
LPC. 


For more info, go to: » Standard 6U CompactPCI and PICMG 2.5 H.110 CT Bus 
www.adlinktech. com/products 
» PICMG 2.1 Hot Swap compliant 64-bit 8-slot CompactPCI 
backplane with P3 & P5 rear I/O 
>» 2+1 Hot Swappable 500W+250W Redundant Power Supply with 
Universal AC input 
» Guarded Power Switch and Reset Button 
>» Dual AC-inlet for Dual AC-input (DC input optional) 
» LED Indication for Power Status, Fan Status, Temperature Alarm 
» Embedded a Chassis Management Tool to Monitor 


Full-size ePCI-X SBC featuring Intel® Pentium® 4 (fan, temperature & voltage) via RS-232 port 
Processor with HT Technology 


The NuPRO-850 features high computing capability and 
supports 800/533MHz FSB hyper-threading Pentium® 4. 


This product incorporates a PCI-X bus for 64-bit/66MHz Call us toll-free at (866) 4-ADLINK 


performance. It has high bandwidth to support AGP8X or email info link a is 
performance VGA display and Serial ATA for high speed OF To it -T on co 


storage. The NUPRO-850 also supports USB 2.0 and 
generic features such as COM, KB, mouse and 
hardware monitoring. 


Intel? . A . 
Communications, a & AADLIN I< 
Alliance eA. 


For more info, go to: Associate Member 
www.adlinktech.com/products www.adlinktech.com 
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Data Acquisition 


Standardizing Digital IF 
Data Transfer with VITA 49 


Intermediate frequency (IF) data normally passes between system elements 
in analog format over coaxial cables. VITA is developing a new interconnect 
Standard for passing IF data between analog front-ends and DSP subsystems 
in a digital, link-agnostic format. 


by Stephen M. Pereira, Chairperson, 
VITA 49 Working Group, Mercury 


Computer Systems, Inc. 


any communications systems 
digitize incoming analog signal 
information with a high-speed A/ 
D converter and then route the digitized 
information between system elements 
for processing and analysis. Typically, 
the digitized information is intermedi- 
ate frequency (IF) data sent from a radio 
frequency (RF) downconverter to digital 
signal processing equipment or sent from 
digital signal processing equipment to an 
RF upconverter. 

Until now, the interface for transmit- 
ting the digitized IF data stream between 
system elements over the communica- 
tions link has been application- and/or 
equipment-specific. Often, it has also 
been proprietary: the system’s digitiz- 
ing source packages the IF data into a 
unique, proprietary format, which the 
signal processing destination must know 
how to unpack (Figure 1). 

As aresult, every time a source or des- 
tination component changes, the interface 
for passing digitized data between them also 
changes, and new software must be written 
to achieve or restore interoperability. 
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Standardizing the Digital IF 
Interface 

Standardizing the digital IF interface 
across receiver/transmitter equipment, 
signal digitization and conversion equip- 
ment and signal processing equipment 
would clearly benefit both OEMs and ven- 
dors. System manufacturers would no lon- 
ger have to rework their systems each time 
they upgraded a component. 

In addition, instead of being locked in 
to a particular vendor, OEMs could pick 
and choose the best component for the 
application at hand from a marketplace 
of interoperable products and essentially 


Notional Signal Receiving System 


Fabric 
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RF from 
antenna 
orsignal, —___ 
generator 


VME 


“plug and play” those components. Ven- 
dors would no longer be required to re- 
write their digital IF interconnect logic to 
yet another data format, saving resources 
and increasing time-to-market. Standard- 
izing digital IF could make system de- 
ployment faster and technology refresh 
easier for both system manufacturers and 
their vendors. 

In 2004, Mercury Computer Sys- 
tems and DRS Signal Solutions (DRS-SS) 
formed an informal industry group that 
solicited participation and input from the 
signal acquisition and processing com- 
munity and its OEM customers for the 
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In today’s digital intermediate frequency (IF) interface, the boards on 
either side of the fiber/copper link provide the application-specific and 
generally proprietary logic that encodes and decodes the digitized data for 


transmission across the link. 





development of such a standard. The in- 
formal industry group voted to associate 
with the VITA Standards Organization 
(VSO), and the VITA 49 Working Group 
was created to design a digital IF interface 
standard for adoption by VITA. 


The Digital IF Data 
Representation 

The Digital IF Data Representation 
(VITA 49) defines a data structure for the 
transmission of digital IF data between 
one or more sources and one or more des- 
tinations for both the receive and transmit 
paths. The Digital IF Data Representation 
is link-agnostic: it defines a methodology 
for representing digital IF data that can 
be layered on top of any transport proto- 
col and any physical communication link 
(Figure 2). 

The goal of the Digital IF Data Rep- 
resentation is to define a data structure 
that can be used by a sensor source to 
transmit digitized data to a signal process- 
ing destination, or by a signal processing 
source to transmit digital data to an emit- 
ter destination. Initially, it is focused on 
radio IF to convey digitized analog radio 
signals between RF communication re- 
ceivers/transmitters and digital process- 
ing devices (Figure 3). 

Although the Digital IF Data Repre- 
sentation is intended for use in both mili- 
tary and commercial applications, it is 
particularly targeted toward beam-form- 
ing and direction-finding signal-process- 
ing systems, as well as communications 
and signal intelligence (SIGINT) systems. 
Any communications system that needs 
to change an analog signal to digital in- 
formation and then send it on for process- 
ing—such as police and fire department 
communications systems—is a _ candi- 
date for the Digital IF Data Representa- 
tion. Designers of electronic intelligence 
(ELINT) systems and software defined 
radio (SDR) systems may also find this 
standard useful. 


Digital IF Data Representation 
Packet Types 

The Digital IF Data Representa- 
tion uses a packet-oriented approach to 


specifying a data representation standard. 
The working group has discussed the 
need for three packet types. A stream- 
ing data packet defines the base structure 
for representing digital IF data, a source 
characteristics packet enables components 
in a distributed system to communicate 
their capabilities to each other, and a sta- 
tus change packet conveys changes in the 
system’s state. 

The Digital IF Data Representation 
basic standard (VITA 49.0) defines the 
streaming data packet. Extensions to this 
standard (VITA 49.x) will define the other 
two types. 


Streaming Data Packet Format 

The streaming data packet is de- 
signed to minimize transmission over- 
head and maximize its applicability to a 
broad range of applications. It consists of 
a header, a few optional header words, a 
variable-size data payload and a trailer 
(Figure 4). 

Because the Digital IF Data Represen- 
tation has been designed with multi-chan- 
nel beam-forming and direction-finding 
applications in mind, the streaming data 
packet provides features that address the 
requirements of these applications. These 
include timestamp support, meta-data sup- 
port and system event support. 

Timestamp support is critical for syn- 
chronizing multiple channels of information 
in beam-forming and SIGINT applications. 
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Meta-data support enables a multi- 
channel source to add meta-data with a 
channel number to a data sample and then 
interleave the samples in one data pay- 
load. The meta-data associated with the 
data sample, which contains the channel 
number, allows the destination to deter- 
mine to which channel a sample belongs 
when the destination unpacks the data. 

System event support enables a source 
to communicate information about system 
events, such as an A/D converter overload 
or aradar antenna crossing north, to a des- 
tination. Events that affect the entire pay- 
load can be indicated, as can events that 
affect only a portion of the payload. 

The packet definition also specifies 
a configuration key for linking a stream- 
ing data packet to other Digital IF Data 
Representation packet types that will be 
defined in the future. 

A streaming data packet can contain 
up to one million words of data payload. 
It allows an equipment manufacturer to 
encode digitized IF data samples of real 
(unsigned or signed) or complex format in 
a comprehensive range of widths, packed 
or unpacked, and tag the data samples (or 
not) for interleaving or other purposes. 
The manufacturer must specify how the 
data is formatted within the data payload, 
including data sample width, type, data 
packing method and whether or not the 
data samples have meta-data. 

The data payload can also be zero to 


VITA 49 Standard 





One or more sources, each 
with one or more channels 


(Data Only) 
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One or more destinations 


or multiple links 


The Digital IF Data Representation is a data representation standard 
that is layered on top of an existing transport protocol standard, such 
as Serial Front Panel Data Port (SFPDP), RapidlO, Ethernet or USB. 
These, in turn, are layered on top of an existing physical link protocol 


standard, such as fiber or copper. 
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And you thought SHE was 


What about your embedded 
system? 

Just like sensing the pea under a 
mattress, your A/D and D/A system 
must detect and measure accurate- 
ly too. So, start your system design 
with boards that handle real world 
irritants like temperature, drift and 
electrical noise. 


Sleep easier knowing your 
measurements are accurate 
because the effective number of 
bits for the whole system is close 
to the ideal 12, 14, 16, or 24-bit 
resolution of the A/D silicon. 
Benefit from onboard fault pro- 
tection circuits that ease system 
integration by handling the noise 
and power sequencing challenges 
common to multi-board designs. 
Eliminate the need for cumbersome 
and complicated layers of auto- 





PC/104 


24-bit A/D 

Our MPC624 provides 24-bit A/D 
for PC/104 with four channels of 
instrument grade voltage measure- 
ment and/or four channels of direct 
connect to off-the-shelf sensors. 


16-bit A/D, 14-bit D/A 

Our headless Pentium, the SBC2596, 
offers Ethernet, GPS, CAN, DC/DC 
converter, and CompactFlash on 

EBX form factor with full 16-bit 

A/D and 14-bit D/A. 


14-bit A/D, 14-bit D/A 

Our SBC4495 is a 486/586 EPIC 
form factor with 14-bit A/D and D/A, 
VGA/Flat Panel, PCMCIA, wireless, 
and GPS. 


For more fully-featured A/D and D/A 
SBC boards, visit our web site. 





sensitive? 


calibration software with our low Ethernet, GPS, CAN, wireless, 
drift specs designed to keep your and the operating system of your 
measurements constant over time. choice. 

And our wide selection of cost Call now and let our technical 
sensitive A/D and D/A boards is team help smooth out those 
something that will make any sensitive issues in your system. 


princess happy. 
We invite you to check out our 
12, 14, 16 and 24-bit A/D and D/A 
single board computers from 
12MHz to 533MHz, 
including VGA, 
CompactFlash® 
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enable the transmission of a header im- 
mediately followed by a trailer to commu- 
nicate an event. The trailer specifies the 
type of event and the header can contain 
a timestamp that indicates when the event 
occurred. 


Implementing the Standard 

To implement the Digital IF Data 
Representation, the source equipment 
manufacturer creates the logic that en- 
codes the digitized data into streaming 
data packet format, while the destination 
equipment manufacturer creates the logic 
that reads and parses the incoming packet 
and hands off the data to the next process- 
ing stage for data payload decoding. 

Both source and destination equip- 
ment manufacturers generally embed the 
digital IF interface logic in FPGAs. The 
source equipment manufacturer must also 
publish its compliance with the Digital IF 
Data Representation’s data payload en- 
coding parameters, typically in the prod- 
uct’s data sheet. In addition, if the source 
equipment manufacturer uses meta-data 
and events, it must publish its meta-data 
and event definitions. 

Source and destination components 
using the Digital IF Data Representation 
are immediately interoperable for digi- 
tal IF transmission over communications 
links. Vendors never have to rewrite those 
products’ packing or unpacking logic. For 
example, while the logic in a destination 
component that decodes the data payload 
will change depending on the application, 
the logic that unpacks it will not. 


Demonstrating the Standard 

In January 2005, Mercury and DRS- 
SS began preparing a demonstration of 
the prototype VITA 49.0 Digital IF Data 
Representation, using a version of the 


Notional System Using VITA 49 
One chassis requires one SBC, one VME bus 
Split chassis requires two SBCs, two separate VME buses 
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The Digital IF Data Representation passing from source to destination. 
The A/D converter and the digital IF logic are provided in a single board, 
the 3U/6U tuner component. Some signal processing logic could also be 


provided on this board. 


streaming data packet format under design 
in the demo source equipment (DRS-SS) 
and destination equipment (Mercury). The 
companies then separately developed their 
respective source and destination digital 
IF logic based on these demonstration ver- 
sions. 

In the demonstration configuration 
(Figure 5), two RF signal generators gener- 
ate an RF signal that continuously sweeps 
over a frequency band to two tuners. Each 
of these tuners converts the incoming RF 
to IF, digitizes it at 80 Msamples/s and en- 
codes the digitized IF data and timestamp 
into the prototype Digital IF Data Repre- 
sentation streaming data packet format. 
The tuners then transmit the streaming 
data packet output over 2.5 Gbit/s Serial 
Front Panel Data Port (SFPDP) fiber to 
the destination equipment chassis, which 
reads and parses the incoming streaming 
data packets. 

Meanwhile, a display computer con- 
nected to the source and destination equip- 
ment chassis via an Ethernet hub a 


a snapshot of the outgoing data from the 
source equipment chassis approximately 
once per second. 

A bit in the streaming data packet 
had been previously identified to mark 
a packet as a snapshot for display. When 
the tuners receive a request from the dis- 
play computer, they set this bit in the next 
packet they process to mark it as a snap- 
shot for the destination equipment. The 
tuners then send this packet to the display 
computer over the Ethernet connection 
and output it over the fiber. 

On the other end, the destination 
equipment chassis examines the snapshot 
bit in the incoming packets and sends the 
snapshot packets it receives over the Eth- 
ernet connection to the display computer. 

The display computer shows the snap- 
shot data from the source equipment chas- 
sis and the destination equipment chassis 
in side-by-side windows. The source win- 
dow shows what was transmitted over the 
fiber, while the destination window shows 
what the destination equipment received. 
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Designed to minimize transmission overhead and maximize its applicability to a broad range of applications, the 
streaming data packet structure is defined in the Digital IF Data Representation basic standard (VITA 49.0). 
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In the configuration for demonstrating the prototype Vita 49 Digital 
IF interface, DRS-SS supplied the RF signal generation and source 
equipment, while Mercury supplied the destination equipment. 


The demonstration showed that 
the sent and received data was exactly 
the same: time-tagged digital IF data 
was moving across the fiber link from 
source to destination without becoming 
corrupted. The demonstration also il- 
lustrated the fact that Digital IF enables 


two chassis to interoperate despite po- 
tentially different software run-times 
and/or environments. 


The Standard Today 
The VITA 49 working group has 
nearly completed the Digital IF basic 


standard (VITA 49.0), which contains 
the streaming data packet definition. The 
group was expected to submit the base 
standard for balloting in the fourth quar- 
ter of 2005. Extensions to the standard, for 
example, the source characteristics and 
source status change packet definitions, 
are in the planning stages and will likely 
be released as future VITA 49.x versions. 

The Digital IF Data Representation 
serves as a Strategic technology that will 
enable OEMs to select processor-based 
DSP boards and digital receivers without 
being required to commit to a vendor-spe- 
cific interface strategy. @ 
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Data Acquisition 





Shared Memory Network 
Targets Video-Centric Data 





Acquisition 


The entry of real-time video data capture technology into data acquisition 
and distribution systems is challenging design engineers to maintain the 
high throughput and low latency needed by large video streams. Distrib- 
uted shared memory networks not only fulfill those needs, but also allow 
remote placement of the data processing system. 


by Ralph Barrera, Curtiss-Wright Controls 
Embedded Computing, Data Communications 






big change has taken place in data ac- 
[Aessicn systems over the last several 

years, as the growing use of video 
data in imaging systems has increased the 
size and speed of the data streams these 
systems need to deliver and process. 

The challenge for design engineers is 
to support the low-latency requirements 
of real-time data acquisition and distribu- 
tion, as well as provide the high through- 
put required to handle large video streams 
without dropping frames or diminish- 
ing the quality of the data. A distributed 
shared memory network can provide the 
low latency and high throughput speed 
needed for these more demanding imag- 
ing systems, while at the same time en- 
able remote placement of the processing 
system away from the frequently harsh 
factory floor environment. 

Formerly, traditional data acquisition 
systems (Figure 1) were typically tasked 
with relatively slow data rates of perhaps 
10 to 100 measurements/second at the high 
end, since they conducted simple types of 
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measurements, such as the displacement 
of an object, the object’s acceleration or 
its temperature. 

Because throughput requirements were 
fairly low, the data could be brought into the 
processing system via normal I/O channels 
using analog or discrete signals. The process- 
ing system itself could be built using simple 


Control, Monitor, Analysis 
and Storage System 





@) Video Sensor 
(A) Analog Sensor 


©) Discrete Sensor 


A/D converters and discrete I/Os. A single 
processor could easily handle and operate on 
the low-level data rates required to measure 
the object’s position, movement or size. 


Enter Video 
Increasingly, however, the trend in 
today’s data acquisition systems is to 


Since traditional, centralized data acquisition systems conducted simple 
types of measurements, they were typically tasked with relatively slow 
data rates of, at most, 10 to 100 measurements/second. These systems 
could be easily built with a single processor, simple A/D converters and 


discrete I/Os. 





add video cameras to capture additional 
data and monitor operations. This trend 
is, in turn, driving a need for greater I/O 
throughput rates. On a typical manufac- 
turing line today that produces any sort 
of component, video images are taken of 
the objects being produced. These images 
are used to perform a computer compari- 
son, via a video link, between the object 
and a known good image, and to look for 
specific characteristics to determine the 
object’s quality. 

Since this use of video means a huge 
increase in data, instead of hundreds of 
samples a second there is now the equiva- 
lent of 20 Mbytes/s of data coming from 
the video and being sent to the computer. 
When a negative result emerges from the 
image comparison, the computer responds 
by sending control data to affect the manu- 
facturing process. For example, the system 
may divert the bad object to a dump bin. 

Unfortunately, factory floors can 
be hot, noisy, dirty and prone to large 
amounts of shock and vibration. This is 
a less than ideal environment for video- 
based processing systems. Another 
problem is the amount of cabling often 
required by video imaging systems. An 
imaging system that monitors multiple 
stages of a process with multiple cameras, 
distributed over tens or even hundreds of 
feet, can require significant amounts of 
physical cabling, which can create a po- 
tential hazard on a factory floor. 


The Standard Network 
Approach 

What’s needed is a high-speed I/O 
network that enables remote processing. 
Several attempts have been made to ad- 
dress the need for the high data through- 
put associated with video imaging, using 
standard networks such as Ethernet. Un- 
fortunately, it is not possible to adequately 
address these challenges with point-to- 
point message-based networks such as 
Ethernet. Although a Gigabit Ethernet 
network has sufficient bandwidth to han- 
dle one video data stream, it lacks the low 
latency required to handle closed-loop 
process control (Figure 2). 

Substantial processing power is also 
required by an Ethernet network just to 
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The high data throughput needed for video imaging data acquisition 


produces challenges not adequately addressed by the point-to-point 
topology of an Ethernet network. Ethernet has enough bandwidth to handle 
One video stream but lacks the low latency required to handle closed loop 
process control, and requires substantial processing power to provide the 
communications protocol. 


provide the communications protocol. 
The source node must know all of the des- 
tination nodes and must specifically send 
a message to each of them. This requires 
modification of the source node’s com- 


Data Processing / Control / 
and/or Monitor Terminal 


Data Processing / Control / 
and/or Monitor Terminal 


munications any time a new destination 
node is added, or if a task using the data 
is moved to another node. These interde- 
pendencies produce a network that is not 
easily scalable and does not easily accom- 
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A shared memory network utilizing a ring topology requires very little 


processor overhead, supports true data broadcasting and is destination- 
controlled. Every network node has access to all data, while network 
latency is minimized and throughput capability is maximized. 
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A shared memory network utilizing a high-throughput technology such 
as SCRAMNet GT can connect multiple processors in heterogeneous 
computers to form a single high-speed, low-latency, real-time distributed 
processing system. Up to 255 nodes on a network ring are supported, with 
a data throughput of up to 210 Mbytes/s. 


modate growth. Although some new Eth- 
ernet switches do support multi-casting, 
performance and complexity would still 
be compromised. 

Ethernet uses a source-controlled pro- 
tocol. This means that the source node con- 
trols which destination nodes have access 
to the data. If multiple nodes require such 
access, the source node simply sends the 
data to each of them. By sending the data 
multiple times the network bandwidth can 
be quickly used up. A single 20 Mbyte/s 
video source would require 40 Mbytes/s 
bandwidth if two nodes required the data. 
If another destination node were added, the 
bandwidth would become 60 Mbytes/s. 


The Shared Memory Network 
with Ring Topology 

To resolve these problems, a network 
is needed that requires very little proces- 
sor overhead, supports true data broad- 
casting and is destination-controlled. A 
shared memory network utilizing a ring 
topology (Figure 3) fulfills all of these re- 
quirements. It ensures that every network 
node has access to all data while it mini- 
mizes network latency and maximizes 
throughput capability to capture and dis- 
tribute the video imaging data along with 
the low-speed sensor data and commands. 
A shared memory network system also 
lets the data processing phase be handled 
remotely, away from the process that is 
being monitored and controlled. It there- 
fore eliminates much of the cabling. 
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Using a shared memory architecture, 
data captured at multiple inspection sta- 
tions can be fed to a number of proces- 
sors, which can take the parts of the in- 
formation they need and work on that 
data simultaneously. Another advantage 
of shared memory systems is that more 
computers can be easily added to the net- 
work as the system’s requirements grow. 
A shared memory network also allows a 
heterogeneous mix of computers, so that 
a specialized high-speed video processing 
computer can be integrated into a network 
of commercial, low-cost PCs. 

A shared memory network, in which 
each node, or computer, has an exact copy 
of the same data, enables all of the imag- 
ing system data to be distributed. Each 
computer on the network can dedicate its 
full processing capacity to a single dis- 
crete task while working on the same set 
of data. Changes to the test/manufacturing 
process—such as adding more sensors or 
changing the processing of the data—may 
be easily done by simply adding another 
computer or changing some routines in 
an existing computer. The data becomes 
available to all nodes without the need to 
change any of the network wiring. 


The SCRAMNet GT Shared 
Memory Architecture 

One example of a shared memory 
network architecture is Curtiss-Wright 
Control’s Shared Common RAM Net- 
work, Greater Throughput (SCRAMNet 


GT). SCRAMNet GT is a high-through- 
put technology for connecting multiple 
processors to form a single, real-time, 
distributed processing system in which 
memory is shared among the proces- 
sors. It supports up to 255 nodes on a 
network ring with a data throughput of 
up to 210 Mbytes/s. 

In a shared memory system the video 
data is sent out on the network ring only 
once. Because each node has access to all 
the available data, if a node selects to use 
and display that data it does so without 
affecting any other node. Conversely, if a 
node decides to drop off and not display 
any data, no other node is affected. The 
receiving station does not have to go back 
to the source and request that the desired 
data be sent again. 

This shared network system archi- 
tecture can be compared to a television 
station broadcast, which is unaffected by 
how many viewers are watching it at any 
given time. It is sent only once, and ad- 
ditional viewers do not affect the source. 
On the other hand, when a point-to-point 
network such as Ethernet distributes a 
Webcast, it must send the data individu- 
ally to every subscriber, significantly bur- 
dening network throughput. SCRAMNet 
GT supports a sustainable throughput rate 
of 210 Mbytes/s, which is comparable 
to Ethernet in a one-time, point-to-point 
connection. 

With Ethernet, however, if the data 
needs to be displayed in several different 
places, for example in three locations, then 
the entire data set has to be sent out three 
times, tripling the throughput. As through- 
put increases on the network it can become 
overburdened, resulting in delays and 
dropped frames. With video, this reduced 
information quality can result in unaccept- 
ably jerky images and lost data. Images 
that might be adequate for normal viewing 
may not be acceptable for a quality inspec- 
tion system using computer vision. 


Remotely Located Data 
Processing 

To address the harsh factory floor 
environment, shared memory enables 
the data processing to be located re- 
motely from the data acquisition. After 
a computer node collects the data and 
pre-processes it, the data is placed in 
shared memory. With SCRAMNet GT, the 








distance between nodes can be quite high: 
using standard shortwave laser transceiv- 
ers, this distance can be as high as 200 to 
300 meters. With longwave transceivers, 
the distance between nodes can reach up 
to 10 kilometers. Cabling is also reduced 
because only a single fiber optic cable runs 
between each computer, as compared to a 
point-to-point network, where every cam- 
era and sensor must be wired individually 
back to a panel connected to the process- 
ing computers. 

Because the processing can be han- 
dled remotely, the transceivers on the fac- 
tory floor do not need to be integrated in 
high-speed processors, since all that is re- 
quired is the simple process of pulling in 
the data and putting it in shared memory. 
The high-performance processors can be 
placed remotely in a safe lab or control 
room environment. 

SCRAMNet GT is the latest version 
of the popular shared memory archi- 
tecture that was first introduced about 
15 years ago. It is the highest band- 
width shared memory system available. 
Although the original SCRAMNet 
had a 150 Mbit/s data rate and could 


transfer data around the network ring at 
20 Mbytes/s, SCRAMNet GT supports 
2.5 Gbit/s data rates and a throughput 
of 210 Mbytes/s with a latency of less 
than 0.5 microseconds per node. It fea- 
tures a one-to-many and many-to-many 
built-in broadcast capability and ensures 
that all nodes receive updated informa- 
tion without intervention from either host 
or user. The original system supported a 
maximum of 8 Mbytes of memory. Each 
SCRAMNet GT board, whether VME, 
PCI or PMC, comes with 128 Mbytes of 
memory (Figure 4). 

Another advantage of using shared 
memory is the low programming cost as- 
sociated with application programs. The 
system designer must assign data only 
to specific areas of shared memory. Ap- 
plication writers then use the data vari- 
ables corresponding to these addresses 
and use the variable names as they would 
normally. Tasks can be moved to other 
processors without any changes to the ap- 
plication itself. In actual practice, a task 
could be talking to another task within 
the same computer, or to a task on the far 
side of the ring. With shared memory, the 
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sending task doesn’t need to know where 
the receiving task is located. 

As video-based imaging systems are 
being more widely deployed, and video 
resolution and speed increase, it is essen- 
tial to ensure that enough bandwidth is 
available. In December 2005, at the I/IT- 
SEC Conference in Orlando, Florida, Cur- 
tiss-Wright exhibited a SCRAMNet GT 
system with a total throughput load of 190 
Mbytes/s. Four video sources—two DVD 
players and two video cameras—were run 
from four nodes generating 50 Mbytes/s 
of streaming video data. Another task 
generating 120 Mbytes/s of additional 
data throughput was added to burden the 
system. The result was no video data deg- 
radation or lost frames. @ 
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Data Aquisition 





Data Acquisition Systems 
Track Signal Processing 
Tecnnology 


Signal processing systems are crunching ever larger amounts of sensor 
data, in turn demanding data acquisition and recording systems that 
can keep pace. Switched fabric interconnect and FPGA-based processing 
make possible the development of high-performance data acquisition 
and playback systems that reuse existing components and provide 
application-specific tailoring where it is really needed. 


by Andrew Reddig 
TEK Microsystems 


S processing capability continues to 
A grow, signal processing systems are 

using ever larger amounts of sen- 
sor data—in resolution, bandwidth and 
number of channels—to perform their 
functions. Data acquisition and record- 
ing systems are required that can test and 
support these advanced signal processors’ 
capabilities. Fortunately, the same tools 
and technologies that enable faster signal 
processing—switched fabric interconnect 
and FPGA-based processing—can also be 
used to implement advanced data acquisi- 
tion systems for a wide range of applica- 
tions, including radar. 





Using a Network Model 

The primary mission of a data acqui- 
sition system is to acquire and store data, 
and lots of it. The first design parameter 
to consider is the amount of data that 
needs to be stored. If the application can 
be implemented with a single channel to 
disk, typically up to 200 Mbytes/s, the 
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system can use either embedded storage 
technology or a PC-based data recorder. If 
the application requires multiple channels 
to disk, from 200 Mbytes/s up to several 
Gbytes/s, the system will typically use a 
switched fabric interconnect to provide 
both scalability and modularity. 

A variety of switched fabrics are avail- 
able with off-the-shelf support for modu- 
lar data recorders. Many legacy radar data 









PMC / RACE++ 250 MB/s 
PMC /VXS 1.0 GB/s 
XMC / VXS 2.0 GB/s 


acquisition systems use RACE++, which 
offers up to 533 Mbytes/s per 6U VME 
slot. Newer systems being developed to- 
day use VITA 41 (VXS) technology to 
scale up to 2.5 Gbytes/s per 6U slot. VXS 
systems can use fabrics such as PCI Ex- 
press or Serial RapidIO, or point-to-point 
links based on the Xilinx Aurora proto- 
col. The choice of protocol depends on the 
interoperability requirements within the 


az 


Using a network model for the data acquisition system as a whole lets the 
system be viewed as a loosely coupled set of processing nodes, each with 
a PowerPC processor, local memory, |/O module site and bridge to the 


fabric. 





system and the complexity of the endpoint 
solution, which is typically implemented 
in an FPGA on each VXS card. 

One benefit of using a switched fabric 
is the built-in support for a network model 
for the system as a whole. The system can 
be viewed as a loosely coupled set of pro- 
cessing nodes, each with a PowerPC pro- 
cessor, local memory, I/O module site and 
bridge to the fabric (Figure 1). 

Nodes can be configured as either stor- 
age or I/O nodes, depending on the type 
of I/O module installed. Because each I/O 
module has its own dedicated processor, 
the software model is very simple. If a Fi- 
bre Channel module is installed, the node 
acts as a storage server, responding to cli- 
ent requests through the fabric network. 
Alternately, if an I/O module is installed, 
the node acts as both an autonomous I/O 
server and a storage client, managing its 
own I/O module and requesting storage to 
disk through the fabric network. 


High-Speed Fiber Optic Data 
Transfer 

In many radar applications, the sen- 
sor data being recorded is converted from 
analog to digital outside the recorder and 
is transferred using high-speed fiber optic 
interfaces. This approach makes it easy 
to insert a data recorder into the system 
without degrading the signal integrity of 
the data being acquired. The data recorder 
typically implements a copy mode that re- 
broadcasts the input data, allowing the re- 
corder to be inserted between the sensor 
and its signal processor without interrupt- 
ing the data flow. 

The most common format for high- 
speed fiber optic transmission is Serial 
FPDP, or ANSI/VITA 17.1. Serial FPDP 
supports 1.062, 2.125 or 2.5 Gbit/s physi- 
cal links, providing data rates of up to 247 
Mbytes/s per fiber. Serial FPDP is de- 
signed to be a simple, low-latency proto- 
col, making it well suited for FPGA-based 
implementations. 

In many radar data acquisition sys- 
tems, the building block that provides 
high-speed fiber interfaces is a PMC mod- 
ule, such as TEK Microsystems’ JazzFi- 
ber (Figure 2). Each of these modules 


provides four independent fiber optic in- 
terfaces connected to an onboard FPGA. 
The module also includes two banks of 
DDR buffer memory to support wire- 
speed buffering of all four data channels. 
When installed on a PCI-X carrier, the 
PMC module supports full throughput, 1 
Gbyte/s transfers between all four chan- 
nels and the host. 

The FPGA can be used to implement 
a wide range of protocols, including Serial 
FPDP, Fibre Channel and Gigabit Ether- 
net, allowing the same module to sup- 
port different types of interfaces through 
FPGA reconfiguration. Each processing 
chain is independent in the FPGA, allow- 
ing a single module to support a mix of 
protocols if required (Figure 3). 


Adjunct Data Processing 
Channels 

While the primary mission of a data 
acquisition system is to record high-speed 
sensor data, most systems also require 
some amount of adjunct low-speed data to 
be recorded as well. This low-speed data 
typically gives information about the plat- 
form itself, which provides the operating 
context necessary for analysis of the high- 
speed data. In some cases, the adjunct 
data affects the high-speed data record- 
ing process directly, modifying the type 
or amount of data being recorded in real 
time as the platform state changes. 
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In many radar data acquisition 
systems, a PMC module, 
such as TEK Microsystems’ 
JazzFiber, delivers high- 
speed fiber interfaces. 

Four independent fiber 

optic interfaces connect to 
an onboard FPGA and two 
banks of DDR buffer memory 
support wirespeed buffering 
of all four data channels. 


While the adjunct data channels tend 
to be lower speed, they also tend to require 
some processing for interpretation and 
formatting of the data. Adjunct data chan- 
nels can use off-the-shelf interfaces—such 
as Ethernet, 1553, SCRAMNet and the 
like—or they may require tailored low- 
level interfaces such as serial or parallel 
TTL, ECL, EIA-485 or LVDS. As long 
as the interface can be implemented on 
a PMC or XMC I/O module, it can eas- 
ily be integrated into the data acquisition 
system. The network model enforces a 
modular approach to adjunct channels, 





A PMC module’s onboard FPGA can be reconfigured to implement 
protocols such as Serial FPDP, Fibre Channel and Gigabit Ethernet, 
allowing the same module to support different types of interfaces. Since 
each processing chain is independent in the FPGA, a single module can 


Support a mix of protocols. 
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allowing any customization or tailoring to 
affect only the interface in question, not 
the other building blocks of the system. 


Applying Effective, Accurate 
Timestamping 

Another common requirement for 
data acquisition systems is the need to 
accept an accurate timecode input and to 
apply a highly accurate timestamp to all 
of the data streams being recorded. Typi- 
cally, precise sample-to-sample timing is 
maintained in the sensor, but the timing 
of each packet of data, both high-speed 
and low-speed, is important for both data 
analysis and for precisely reproducing the 
data at a later time. 

Effective and accurate timestamp- 
ing requires a number of elements, both 
at the system level and at the individual 
processing node. First, the system needs 
a system-wide timebase that is distributed 
to all of the processing nodes and is ac- 
cessible to both hardware and software in 
each node. Second, the system needs an 
IRIG or other timecode input that can be 
precisely synchronized to the system-wide 
timebase and the results broadcast to all 
of the processing nodes. Third, each I/O 
channel needs a mechanism for applying 
the system-wide timebase to input events 
as close to the actual input as possible. 

In TEK Microsystems’ data acquisi- 
tion systems, the switched fabric intercon- 
nect is used along with hardware support 
on the carrier cards to implement a sys- 
tem-wide timebase. In RACE++-based 
systems, the timebase is based on the 
RACE++ XCLK and has a timing accu- 
racy of 15 nanoseconds. In VXS systems, 
the timebase is based on an adjunct ref- 
erence clock signal and has a timing ac- 
curacy of 8 ns. Hardware support on the 
carrier card allows IRIG or | pulse per 
second inputs to be precisely timestamped 
against the system-wide time. This allows 
software to broadcast timing reference 
points to the other processing nodes at a 
frequency of once per second. 

Each processing node can perform 
timestamping in either hardware or 
software, depending on the capability 
of the I/O interface being implemented. 
Off-the-shelf I/O modules such as 1553 
and Ethernet do not support hardware 
timestamping. Instead, they use soft- 
ware-driven timestamping of messages 
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Applying windowing to the high-speed data as early as possible in 
the processing chain maximizes efficient use of memory and fabric 
resources, as shown for a typical Serial FPDP processing chain for one of 


four channels. 


or packets for accuracies of 20 to 50 
microseconds. 

High-speed fiber optic interfaces 
using FPGA-based PMC modules can 
support hardware-based timestamping 
through the FPGA, using the system-wide 
timebase provided by the carrier on the 
PMC Pn4 connector. Each fiber optic in- 
put accesses the timebase at an early stage 
in the processing chain, providing a time- 
stamp with an accuracy of 100 ns or better 
per packet. 

During playback, the same FPGA- 
based approach is used to precisely repro- 
duce high-speed data, using the timestamp 
and the system-wide timebase to hold back 
transmission of each packet until the exact 
time it 1s required. For applications where 
such timing is critical, this allows the sys- 
tem to precisely reproduce the packet tim- 
ing that was recorded. 


Applying Static or Dynamic 
Windowing 

In some applications, the high-speed 
data input to the system contains a mix of 
critical and non-critical data. Because it is of- 
ten necessary to limit the number of attached 
RAID storage devices due to volume, weight 
or cost constraints, the application often must 
selectively decide to record or discard por- 
tions of the high-speed data streams based on 
the amount of available throughput. 

In some applications, the windowing 
algorithm is determined in advance as a 
part of the mission configuration. In other 
applications, the algorithm is driven by 
adjunct channel input during the mission 
and needs to be applied to the high-speed 
data in real time. 


The network model can be used to 
minimize the complexity of these dif- 
ferent implementations. The I/O server 
software that controls the high-speed data 
is designed to accept windowing param- 
eters from other processing nodes. If the 
windowing parameters are static, they 
are simply defined at the beginning of 
the mission and are not changed. If the 
windowing parameters are dynamic, they 
can be changed in real time by the adjunct 
channel processing node through a simple 
API call. 

To maximize the efficient use of 
memory and fabric resources in the sys- 
tem, it is usually best to apply windowing 
to the high-speed data as early as possible 
in the processing chain. For fiber optic 
data, the FPGA processing capability on 
the PMC module can easily accommodate 
either static or dynamic windowing as a 
part of the high-speed data procession 
chain (Figure 4). 


Industry Standard File System 

By building a data acquisition system 
using a network model, supporting a mix 
of high-speed and adjunct channels, and 
implementing timestamping and window- 
ing, the storage nodes are presented with 
the right data, along with enough addi- 
tional information to make that data use- 
ful. The user still needs to be able to ac- 
cess the recorded data, either for analysis, 
transcription or playback. Typically, data 
is recorded once but accessed many times 
after the mission is over, making the data 
format on disk a critical part of the overall 
usability and effectiveness of the data ac- 
quisition system. 








One approach to data formatting is 
to use a real-time implementation of the 
standard FAT32 file system for all data 
recording and playback. This file system 
is directly supported by Windows, Linux 
and Solaris workstations, allowing RAID 
storage arrays to be directly accessed by 
standard workstations without requiring 
special software or drivers. 

Each channel of data is written to 
its own file on the disk, with a common 
format for headers and other adjunct in- 
formation such as timestamps. The use of 
a standard file system and a common for- 
mat enables the development of a body 
of transcription and verification soft- 
ware that is common to a range of data 


acquisition systems and largely indepen- 
dent of the specific type of data being 
recorded. 

Signal processing systems today 
make use of switched fabrics to create 
modular, scalable solutions, using FPGA- 
based processors to perform processing 
at higher densities than is possible with 
general-purpose processors. The use of 
the same tools and techniques supports 
a scalable and flexible approach to build- 
ing data acquisition systems. Leverag- 
ing off-the-shelf hardware and software, 
along with tailoring when necessary 
through FPGA-based processing, allows 
the use of industry standard components, 
enclosures, backplanes, I/O modules and 
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RAID disk arrays. This makes it possible 
to develop very high-performance data 
acquisition and playback systems, with 
application-specific tailoring where nec- 
essary, while reusing existing hardware, 
software and FPGA components for the 
majority of the system. @ 
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Serial RapidlO Fabric Offers 
Robust Scalability and 
Performance 


Now joining the parallel specification, Serial RapidlO builds on 
compatibility and adds flexibility and scalability to a fabric 
technology that can span many interfaces and media. 


by Tom Cox 
RapidlO Trade Association 


interconnect and fabric standard for 

embedded systems, providing in- 
creased performance, improved efficiency 
and lower cost. RapidIO technology is 
supported by a broad ecosystem of leading 
vendors with multiple vendors shipping 
production switches, endpoints, FPGAs, 
boards, software and systems. Serial RapidIO 
technology offers a high-speed physical layer 
that can be configured to match bandwidth 
requirements with different speed variants 
and numbers of lanes. 

Serial RapidIO builds on the commu- 
nication industry’s common roadmap at 
the serial physical layer, using a variant of 
IEEE 802.3 Xilinx 10 Gigabit Attachment 
Unit Interface (XAUI) today for 3.215 
Gbits/s. For future 5 and 6 Gbit/s versions, 
it is using a variant of the work done on 
the Optical Internetworking Forum’s (OIF) 
Common Electrical Interface (CEI). 

RapidIO architecture has no inher- 
ent limitations preventing it from scaling 
indefinitely into the future, following or 


in apidIO technology is a fast growing 
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The RapidlO protocol can be over both serial and parallel interfaces 
and is media-agnostic. Therefore, the serial specification is defined 
at the physical electrical layer and the rest of the specification is 
preserved at the higher levels. 
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Parallel RapidlO 





Clock Rate 
PEAK 
250 MHz 16 Gb/s 
500 MHz 32 Gb/s 
750 MHz 48 Gb/s 
1GHz 64 Gb/s 
Serial RapidlO 
Clock Rate = [7 tbitWide 
PEAK 
1.25 GHz 8 Gb 
2.5 GHz 16 Gb 
3.125 GHz 20 Gb 
Table 1 


anticipating industry requirements. RapidIO 
technology has evolved over the past five 
years to a full system dataplane fabric, with 
extensions completed and in progress for: 
¢ RapidIO Flow Control Logical Layer 
Extensions Specification 
eRapidIO Data Streaming Logical 
Layer Extension Specifications 
- Phase I: Encapsulation and Traffic 
Management Framework 
- Phase II: Advanced Traffic 
Management 
° RapidIO Multicast Extensions Speci- 
fications 
°RapidIO Next Generation Physical 
Layer Specifications 


A comparison of the data rates be- 
tween the different modes of Parallel and 
Serial RapidIO is given in Table 1. 


Flexible Physical Interface 

The RapidIO logical packet descrip- 
tion is defined to be physical-layer-inde- 
pendent. This means that the RapidIO pro- 
tocol could be transmitted over anything 
from serial to parallel interfaces, from 
copper to fiber media. The first physical 
interface considered and defined is known 
as the 8- or 16-bit link protocol end point 
specification (8/16 LP-LVDS). This speci- 
fication is defined as having 8 or 16 data 
bits in each direction along with clock and 
frame signals in each direction. 

The 8/16 LP-LVDS interface is a 
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source-synchronous interface. This means 
that a clock is transmitted along with the 
associated data. Source synchronous clock- 
ing allows longer transmission distances at 
higher frequencies. Two clock pairs are pro- 
vided for the 16-bit interface to help control 
skew. The receiving logic is able to use the 
receive clock for re-synchronization of the 
data into its local clock domain. 

Since the Serial RapidIO specification 
is only defined in the physical layer (Rapi- 
dIO technology defines the physical layer as 
the electrical interface and device-to-device 
link protocol), most of the controller remains 
the same. As a result, much of the design 
knowledge and verification infrastructure 
are preserved (Figure 1). This eases system- 
level switching between parallel and serial 
links. During the initial development stages 
of the Serial RapidIO specifications the 
designers decided to preserve as many of 
the concepts found in the RapidIO parallel 
specification as feasible. The parallel speci- 
fication includes the concept of packets and 
in-band control symbols. 

These were delineated and differenti- 
ated by both a separate frame signal and 
an “S” bit in the header. In the serial link 
specification this delineation is accom- 
plished using spare characters (“K-codes’’) 
found in the 8B/10B encoding technique. 
In this way, the sending device indicates 
to the receiving link partner the start of a 
packet, end of packet or embedded control 
symbol using these codes. 


16-bit Mode 
Sustained Sustained 
32 byte Op 256 byte Op 
8 Gb/s 15 Gb/s 
16 Gb/s 30 Gb/s 
24 Gb/s 45 Gb/s 
32 Gb/s 60 Gb/s 
4-bit Wide 
Sustained Sustained 
32 byte Op 256 byte Op 
4 Gb 7.2 Gb 
8 Gb 14.4 Gb 
10 Gb 18 Gb 


Comparison of data rates in different modes and clock frequencies between Parallel and Serial RapidlO. 


Comprehensive Link Protocol 

A unique feature of RapidIO technol- 
ogy is that packet transmission is managed 
on a link-by-link basis. In the past, with 
synchronous buses, a mastering device 
had to exchange handshake signals with 
the target device. These signals indicated 
whether a transaction was acknowledged 
and accepted by the target device. With an 
interface such as the RapidIO specifica- 
tion defines, it is not practical to rely on a 
synchronous handshake since the receive 
port of a link is decoupled from the send- 
ing port. Therefore, many interconnects 
have ignored this issue and rely on an 
end-to-end handshake to guarantee deliv- 
ery. However, this has the disadvantage of 
preventing precise detection and recovery 
of errors and forces far longer feedback 
loops for flow control. 

To address this issue RapidIO uses 
embedded control symbols for link-level 
communication between devices. Pack- 
ets are explicitly tagged between each 
link with a sequence number otherwise 
known as AckID. The AckID is inde- 
pendent of the end-to-end transaction ID. 
Using control symbols, the receiving de- 
vice indicates for each packet whether it 
has been received along with additional 
buffer status information. Receiving de- 
vices can immediately detect a lost packet, 
and through control symbols, can re-syn- 
chronize with the sender and recover 
it without software intervention. The 








receiving device then forwards the packet 
to the next switch in the fabric, and so on, 
until the packet reaches its final target. 

Serial RapidIO allows longer trans- 
mission distances and thus involves lon- 
ger loop latencies in providing feedback 
between the receiver and transmitter on 
a link. Consequently, the Serial physical 
layer specification increases the number 
of AckID values from 8 to 32. 

Additionally, the Serial RapidIO 
specification now defines a transmitter- 
controlled flow control scheme whereby 
the receiving port provides information to 
its link partner about the amount of buf- 
fer space it has available. With this infor- 
mation, the sending port can allocate the 
use of the receive buffers of the receiving 
port. The sending port does not have to be 
concerned that one or more of the packets 
shall be forced to retry. 


PCS and PMA Layers 

The Serial RapidIO specification 
uses a physical coding sublayer (PCS) and 
physical media attachment (PMA) sub- 
layer to organize packets into a serial bit 
stream at the sending side and to extract 
the bit stream at the receiving side. This 
terminology is adopted from IEEE 802.3. 

Besides encoding for transmission 
and decoding for reception, the PCS func- 
tion is also responsible for idle sequence 
generation, lane striping, lane alignment 
and de-striping on reception. The PCS 
uses 8B/10B encoding for transmission 
over the link. 

The PCS layer also provides the 
mechanisms for automatically deter- 
mining the operational mode of the 
port as either l-lane or 4-lane, and pro- 
vides for clock difference tolerance be- 
tween the sender and receiver without 
requiring flow control. The PMA func- 
tion is responsible for serializing 10- 
bit parallel code-groups to/from a se- 
rial bit stream on a lane-by-lane basis. 
Upon receiving data, the PMA function 
provides alignment of the received bit 
stream to 10-bit code-group boundar- 
ies, independently on a lane-by-lane 
basis. It then provides a continuous 
stream of 10-bit code-groups to the 
PCS—one stream for each lane. The 
10-bit code-groups are not observable 
by layers higher than the PCS. 


Robust Electrical Interface 

Serial RapidIO uses differential cur- 
rent steering drivers based on those de- 
fined in the 802.3 XAUI specifications. 
This signaling technology was developed 
to drive long distances over backplanes. 

For Serial RapidIO technology, two 
transmitter specifications were designated: 
a short run transmitter and a long run 
transmitter. The short run transmitter is 
used mainly for chip-to-chip connections 
either on the same printed circuit board or 
across a single connector such as that for 
a mezzanine card. The minimum swings 
of the short run specification reduce the 
overall power used by the transceivers. A 
user can further reduce the power by low- 
ering the termination voltages. 

The long run transmitter uses larger 
“voltage swings” that are capable of driv- 
ing across backplanes. This allows a user 
to drive signals across two connectors and 
common printed circuit board material. 
To ensure interoperability between driv- 
ers and receivers of different vendors and 
technologies, AC coupling must be used 
at the receiver input. 

The engineer’s interconnect choices 
may include use of proprietary, home- 
grown technologies, legacy interfaces or 
application-appropriate emerging  stan- 
dard technologies. The three leading 
choices are Ethernet, PCI Express and 
RapidIO technology. While the three in- 
terconnect technologies have some simi- 
larities, they are quite different in terms 
of technical merit. In many cases they can 
be highly complementary in the overall 
system architecture landscape. 

RapidIO was designed specifically 
as a widely applicable, flexible, extensible 
system fabric for embedded infrastructure 
equipment including networking, storage 
and communication systems. PCI Express 
was formulated as an improvement on the 
Peripheral Component Interconnect bus, 
primarily for the commercial comput- 
ing market. Historically, PCI, because of 
its ubiquitous nature and the consequent 
economies of scale, has been adopted 
within embedded systems despite not nec- 
essarily providing optimum functionality. 
There may be a similar desire to force-fit 
PCI Express into applications beyond the 
intent of the architectural scope of that in- 
terconnect. However, this is likely to be 
at the expense of inferior functionality, 


reduced performance, non-standard bridg- 
ing and more complex system design than 
the adoption of an application-appropriate 
standard, such as RapidIO technology. 
Ethernet developed for system-to-system 
local networks requires heavy over pro- 
visioning for embedded applications and 
lacks determinism, reliability and robust 
error handling. 

The RapidIO fabric provides a robust 
packet-switched system level intercon- 
nect. It provides a partitioned architecture 
that can be enhanced in the future. It en- 
ables higher levels of system performance 
while maintaining or reducing implemen- 
tation costs. A RapidIO end point can 
be implemented in a small silicon foot- 
print. Proven industry-standard signaling 
schemes (LVDS, XAUI) are used for the 
physical interfaces. Error management in- 
cludes the ability to detect multi-bit errors 
and survive most multi-bit and all single 
bit errors. Even with all these capabili- 
ties, the RapidIO protocol overhead and 
latency are comparable to current bus 
technologies and significantly better than 
local area network-based fabric technolo- 
gies such as Ethernet. @ 
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storage Systems Merge 
into the Express Lane — 
PCI Express 


What was once the domain of PCI and PCI-X in storage has given way to 
PCI Express. PCle is bringing benefits to storage-system controller boards 
as well as to the add-in cards for Fibre Channel, SCSI and SATA host bus 
adapters, and to RAID controllers. 


by Steve Moore 
PLX Technology 


torage systems large and small have 
Ss adopting PCI Express (PCIe) 

technology as the interconnect stan- 
dard at the card-to-card level, reflecting 
a natural evolution from last year’s chip- 
to-chip level interconnect deployment 
of PCIe. There are many reasons for the 
transition, foremost among them being 
the availability of PClIe-based chipsets for 
storage-system controller applications. But 
beyond that, what is driving the transition 


is PClIe’s scalability of bandwidth, robust- 
ness and integrity of data transfer, reduced 
pin count of ASICs and simplified circuit 
board layout—all of which add up to sys- 
tems that are faster and less expensive. 
Extending the bandwidth of storage 
controllers was easy back in the 32-bit 
PCI days. Throughput could be doubled 
by moving from PCI’s 33 MHz to its next- 
generation 66 MHz. It was doubled again 
by going from 32-bit to 64-bit PCI, then 











PCI Express in Storage System board. 


doubled once more by moving from PCI’s 
64-bit, 66 MHz to the 64-bit, 133 MHz 
performance of PCI-X. This worked well 
enough in getting throughput to one giga- 
byte per second. However, the next jump 
in performance meant either doubling the 
bus width to 128 bits or doubling the bus 
rate to 266 MHz. Both have serious draw- 
backs. First, 128-bit bus signals would re- 
quire excessive board space for the traces 
and make for high-pin-count chips, adding 
cost, footprint, power dissipation and noise 
with all those I/Os switching. Secondly, 
cranking the bus frequency up to 266 MHz 
increases the effect of clock skews and 
makes it extremely difficult to meet the 
system timing requirements. This is where 
a serialized architecture makes sense. 


Scalability and Reliability with 
PCI Express 

Storage interconnects such as U320 
SCSI, SAS, SATA2 and Multi-Gigabit Fi- 
bre Channel are delivering performance 
improvements with increased connection 
speeds. PCIe offers the ability to deploy 
multi-port adapters and RAID controllers 
without creating a local I/O bottleneck. 
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Host CPU 


PCI-X to-PCl Express 
Bridge 


Adding PCI Express to PCI-X 
System Boards Requires 
Reverse Bridging. 





This U320 SCSI host adapter 
from PLX has a PCI-to-PCl 
Express Bridge added to a 
PCI-X HBA. 


As storage interconnects transition to 10 
Gbit/s speeds, PCIe can provide the basis 
for a long-term roadmap for storage-per- 
formance capability. InfiniBand connec- 
tions enabled for PCIe deliver data centers 
a full 10 Gbits/s of clustering connectiv- 
ity today. Table 1 shows the performance 
scalability as a function of lane count 
for 2.5 GHz (Gen 1) PCle. With Gen 2 
(5 GHz) on the horizon, these bandwidths 
will double, from 8 Gbytes/s to 16 Gbytes/s 
per direction, for an aggregate maximum 
bandwidth of 32 Gbytes/s. 

When the system software finds a 
PCle bridge or switch, it looks just like 
a PCI bridge; no software changes or 


Link Width 

Bandwidth in Gbits/s (raw, aggregate) 
Bandwidth in Gbytes/s (aggregate) 
Bandwidth in Gbytes/s (per direction) 


PCI Express 
Add-On Slot 


drivers are required to accommodate 
PCIe. From the viewpoint of the system 
model, each PCle port is a virtual PCI-to- 
PCIe device and has its own set of PCIe 
configuration registers. It is through the 
upstream port that the BIOS or host can 
configure the other ports using standard 
PCI enumeration. The virtual PCI-to- 
PCIe bridges within PCIe switches and 
bridges are compliant with the PCI and 
PCIe system models. The Configura- 
tion Space Registers (CSRs) in a virtual 
primary/secondary PCI-to PCIe bridge 
are accessible by type 0 configuration 
cycles through the virtual primary bus 
interface—matching bus number, device 
number and function number. 

PCIe provides a more robust inter- 
connect compared with PCI-X. This en- 
hances system reliability and data avail- 
ability. Since PCIe is a point-to-point 
architecture, it eliminates the shared bus 
that is used in PCI-X. With a shared bus, 
it is never clear how many devices may re- 
side and what impact the various devices 
may have on the bus bandwidth. But with 
a dedicated channel for each endpoint, 
quality of service and throughput can be 
deterministic. 

In the PCle architecture, the Data 
Link Layer (Layer 2) provides link man- 
agement and ensures data integrity us- 
ing error detection and correction. This 
layer calculates and appends a Cyclic 
Redundancy Check (CRC) and assigns 
a sequence number to the information 
sent from the data packet. The sequence 
number allows proper ordering of the 
data packets. The CRC verifies that 
data from link to link has been correctly 
transmitted. In addition, the PCle speci- 
fication allows for providing end-to-end 
CRC protection (ECRC) and poison-bit 
support to enable designs to guarantee 
error-free packets. While these features 
are optional in the PCIe specification, 
they are already integrated in some ven- 
dors’ PCle devices. 
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PCIe includes a hot plug capability, 
allowing users to replace add-in cards 
and other hardware modules to perform 
maintenance without powering down the 
system. Each downstream port includes a 
standard hot plug controller. If the PCle 
switch used in an application where one or 
more of its downstream ports connect to 
PCIe slots, each port’s hot plug controller 
can be used to manage the hot plug event 
of its associated slot. Furthermore, its up- 
stream port is a hot plug client, allowing 
it to be used on hot-plug-capable adapter 
cards, backplanes and fabric modules. 


System Board Uses 
PCI Express 

The earliest deployment of PCle has 
been for chip-to-chip interconnect on sys- 
tem boards, as shown in Figure 1. In this 
example, a dual-host system’s CPUs and 
memory chips are interconnected via a 
PCIe root complex. A root complex is a 
specialized PCle switch. The root com- 
plex is also connected to a standard PCle 
switch to provide several channels of I/O 
fan-out. In this example, the standard 
PCIe switch fans out the root complex to a 
PCle bridge, switch and a native endpoint. 
The PCIe bridge allows the creation of 
PCI/PCI-X slots for various I/O functions 
including legacy adapter cards and com- 
munications ports. The second-level stan- 
dard PCle switch provides fan-out for sev- 
eral PCIe slots. The PCIe endpoint could 
be any number of PCIe devices, such as 
PClIe-native Gigabit Ethernet controllers 
or adapters. 

Some designs may involve a system 
board with plenty of performance, but fea- 
ture only a PCI-X interconnect and need to 
be able to connect to PCle add-in cards. A 
PCle-to-PCI-X bridge allows the migra- 
tion of an existing PCI-X storage system 
board to accept PCIe add-in cards. This 
allows the creation of a PCIe system board 
without having to qualify a completely new 
chipset. The catch here is that the bridge 


x16 x32 = PCI 32/66 
3 16 my =PCI-X 64/133 
4 8 


Table 1 PCI Express Scalable bandwidth (Gen 1, 2.5 GHz PHY). The yellow box indicates the equivalent maximum bandwidth for a 
32-bit, 33 MHz PCI bus; the green, 32-bit, 66 MHz and the magenta, 64-bit, 133 MHz PCI-X functionality. 
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Non-Transparent Bridging 
between PCI and PCle. 


must be capable of reverse-bridging mode, 
which isn’t found on all PCIe bridges. With 
reverse mode, the PCI-X port is the up- 
stream port, and the PCle port is the down- 
stream port (Figure 2). 

Most of today’s high-performance 
storage cards use PCI-X. Many such 
cards have a single-chip implementation 
to create a one- or two-port adapter card, 
and they use several adapter ICs to cre- 
ate multi-port adapter cards. A PCI-X-to- 
PCIe bridge allows the rapid deployment 
of PCIe connectivity, as shown in Figure 3. 
If a multi-port solution is desired, multiple 
adapter devices can be connected on the 
PCI-X bus, but the maximum bus rate will 
depend on the total number of PCI-X de- 
vices on the bus. Many PCI-X devices are 
capable of handling multiple secondary 
devices at frequencies less than 133 MHz. 
One word of caution: Since the data flows 
through a PCI-X bus and PCIe link, the 
maximum throughput will be limited by 
the latency of the bridge. This constric- 
tion is a function of the transaction layer 
packet (TLP) sizes and how many resul- 
tant retries are required, but it is unlikely 
that a full 1 Gbyte/s will be achieved with 
a x4 PCle link. 

Intelligent adapters, including RAID 
controllers, employ a processor on the card 
in order to offload the host. Non-trans- 
parent bridges have been used in PCI- and 
PCI-X-based cards to achieve domain iso- 
lation between the host and the processor 
on the card. A PCIe card employs a bridge 
or a switch with non-transparent bridging 
for this purpose (Figure 4). 

A multiprocessor architecture can 
provide high system reliability. After 
enumeration by the primary processor, a 
secondary host monitors the system state, 
including the health of the primary pro- 
cessor. If the primary processor fails, then 


the secondary processor assumes control 
without bringing down the system. This 
failover operation requires a non-transpar- 
ent bridge in a PCI-X environment. With 
PCIe, a non-transparent switch such as the 
PLX PEX 8532 provides the high-speed 
interconnection and domain isolation re- 
quired for dual-host operation. 

PCle has emerged as the foremost 
interconnect standard for chip-to-chip 
connections and add-in cards. It provides 
several advantages over its predecessors 
while maintaining software compatibility 
and performance scalability. PClIe’s long 
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list of features that build on the PCI and 
PCI-X legacy—hot plug, ECRC, quality 
of service, deterministic bandwidth and 
a generically more robust interconnect 
based on a point-to-point topology—are 
enabling robust storage systems for real- 
time and mainstream application. @ 
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Dual-Core Processing Drives 
High-Performance Embedded 


systems 


Dual-core processors such as the AMD Opteron can overcome the problems 
associated with high-performance single-core CPUs, while delivering 
performance increases. Combined with improved microarchitecture, 
multithreading and HyperTransport connectivity, this technology is being 
harnessed to the needs of demanding embedded applications. 


by Matt Stevenson and John Hill 
WIN Enterprises 


ual-core CPUs have been com- 
1) mercially available since 2000 
when IBM first introduced the 
IBM POWER4. They provide a method 
for gaining greater performance while 
avoiding the increases in form-factor, in- 
cremental heat and power requirements 
associated with the higher feature density 
of fast single-core processors. In pursuing 
dual-core CPUs, the major IC manufactur- 
ers have acknowledged that the historical 
approach of gaining performance by sim- 
ply increasing CPU feature density has 
reached diminishing practical returns. 
The current generation of high-per- 
formance CPUs (Table 1) is 90 nanome- 
ters (nm) between surface features, thus 
entering the realm of bona fide nano- 
technology, which is 100 nm and below. 
However, at this extreme density, there 
are many unwanted effects. The industry 
has grown accustomed to ever improving 
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performance with each CPU generation, 
but the current level of miniaturization of 
feature sizes is forcing IC manufacturers 
to look to more innovative solutions. 

The problems caused by extreme 
feature density are interrelated. Electri- 
cal features in extreme proximity produce 
quantum effects, 1.e., electrons that ran- 
domly tunnel across the CPU’s features 
causing interference with normal signal 


semiconductor Technology Generations 
by Feature Size 
(averaged across vendors) 





1982 1,500 

1993 600 

1998 250 

1999 180 

2001 130 

2003 90 current generation 
2005 65 just beginning to appear 
Table 1 


transmission. At the highest frequencies, 
tunneling can become so extreme that it 
totally negates signal recognition. 

To drive high performance across 
smaller, more powerful transistors re- 
quires more power. In turn, higher power 
results in unacceptable levels of waste 
heat as power (wattage) increases and 
produces more unwanted quantum ef- 
fects. Machines with dense CPUs run- 
ning at higher wattage are noisier, be- 
cause they require additional, more pow- 
erful fans for cooling. Fan motors add yet 
more electrical noise. 

Dual-core processors, such as the 
AMD Opteron, can mitigate these prob- 
lems while at the same time enabling sig- 
nificant increases in performance. 


The AMD Opteron Dual-Core 
Processor 

The AMD Opteron processor is a 
high-density, 90-nm CPU, packing 233 
million transistors on a 199 mm? die. The 
chip is microarchitected to lessen un- 
wanted effects, principally through thread- 





level parallelism. It uses other technology, 
such as HyperTransport interconnect, in 
order to work smarter, not hotter. 

Dual-core processors are most effec- 
tive in applications that feature highly par- 
allel processes. However, the technology 
can realize significant gains when applied 
to nearly any application that involves 
all but the simplest sequential number 
crunching. IBM, which has incorporated 
the dual-core AMD processor in some of 
its servers, reports 60% faster processing 
with a 2.2 GHz dual-core AMD Opteron 
processor versus AMD’s 2.6 GHz single- 
core processor in tests using the Linpack 
HPL benchmark. Other tests, such as 
floating-point and integer processing, have 
yielded even better gains (Figure 1). 

This increase in performance has 
generated interest among OEMs designing 
embedded systems for demanding, low- 
latency, high-performance applications, 
such as industrial automation, military, 
medical and security imaging, storage and 
telecommunications. The dual-core AMD 
Opteron processor enables basic reference 
designs that can be modified to meet these 
systems’ needs for compactness, design 
longevity, lower power consumption, low 
latency and high reliability, often in harsh 
environments. 

In response to market forces and 
evolving technology in x86 processors, 
such as the AMD Opteron and Pentium M, 
many designers of high-performance em- 
bedded systems are turning from highly 
specialized platforms to x86-based solu- 
tions. These systems typically run either 
Windows Embedded XP or Linux. 

Regardless of the operating system 
chosen, it should be dual-core-aware in 
order to provide the benefit of multi- 
threading. The dual-core AMD Opteron 
provides improved 32-bit legacy appli- 
cation support, in addition to concur- 
rent 64-bit performance. This ability to 
support legacy applications enables a 
smooth upgrade path in the enterprise 
market and expands the dual-core Opter- 
on’s flexibility in the high-performance 
embedded market. 

Terascala, which manufactures stor- 
age appliances for Linux-based clusters, 
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Sandra CPU Floating Point Benchmark 


280 


265 






252 2x Opteron 252: 60099 


244 


2x Opteron 280 


165 [2 Core]: 110952 





0 20000 40000 60000 80000 100000 120000 
Floating-Point x8 iSSE2 (it/s) 


AMD Opteron 165 [2 Core] 1.8GHz 2x1ML2 
2X AMD Opteron 244 1.8GHz 1ML2 

2X AMD Operton 252 2.6GHz 1ML2 

2x AMD Operton 265 [2 Core] 1.8GHz 2x1ML2 
2x AMD Operton 280 [2 Core] 2.4GHz 2x1ML2 


In floating-point performance tests (iterations per second) conducted at 


WIN Enterprises, the 2.4 GHz dual-core AMD Opteron 280 shows an 85% 
improvement over the 2.6 GHz single-core Opteron 252. This improvement 
can be attributed to the dual-core processor’s ability to multithread its tasks. 


Single-Core Opteron Dual-Core Opteron 





The dual-core AMD Opteron processor utilizes the same basic architecture 


as the single-core Opteron, but reduces board-level footprint. The two 
cores connect to a common crossbar that manages processing tasks and 
a dedicated L2 cache for each core provides scalability. 
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The MB-06047 EBX SBC from WIN Enterprises contains a low-power 
dual-core AMD Opteron CPU with a PCI Express slot, a CompactFlash 
socket, an ExpressCard socket, 4x SATA and a stackable HyperTransport 


connector. 


is utilizing the dual-core Opteron CPU on 
motherboards co-designed and manufac- 
tured by WIN Enterprises. The combina- 
tion of HyperTransport connectivity, im- 
proved microarchitecture and dual-core 
CPUs enables these storage systems to 
provide the high performance, scalability 
and high I/O throughput required by their 
enterprise customers. 

Since Terascala rack-mounts several 
storage units into its cabinets, the benefits 
of a high-performance processor with a 
smaller footprint and less waste heat are 
especially important in serving data stor- 
age application needs. In addition, trans- 
action-intensive storage environments 
require the dual-core architecture’s low 
latency, which approaches real-time per- 
formance (Figure 2). 


Multithreading 

Multithreading separates program- 
ming into concurrent tasks across the 
two processing cores for enabling paral- 
lel processing. This results in more ef- 
ficient processing and system resource 
utilization. An AMD Opteron dual-core 
design with two 2.2 GHz cores on a sin- 
gle die can outperform a 2.6 GHz single- 
core CPU because the dual cores can 
efficiently divide their processing tasks. 
This is true even though the clock fre- 
quency of the dual-core solution is slower 
than that of the single-core solution in or- 
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der to control the dual-core CPU’s level 
of waste heat production. However, even 
with multithreading techniques, higher 
performance is not a given. 

The new x86 microarchitecture of 
the dual-core AMD Opteron processor is 
highly sophisticated. For instance, it fea- 
tures an integrated DDR memory control- 
ler. On-chip local memory in the form of 
L2 cache eliminates the need for the CPU 
to constantly fetch processing loads from 
RAM, as would be necessary with tradi- 
tional Northbridge bus architectures. 

Some of the Northbridge functional- 
ity, such as the memory controller, is de- 
signed into the CPU for greater through- 
put, resulting in a low-latency intercon- 
nect. This compares to the traditional 
Northbridge/Southbridge bus architecture, 
which can gate high system performance. 
This design innovation in microarchitec- 
ture is a major reason for the performance 
gains of both single- and dual-core AMD 
Opteron processors. 


HyperTransport Technology 

Originally begun at AMD, Hyper- 
Transport technology is a major advance- 
ment in chip-to-chip and board-to-board 
interconnection, and an important en- 
abling technology in multicore CPUs, 
which is being applied to both commer- 
cial and embedded computing by several 
manufacturers. 


A high-speed, low-latency technol- 
ogy, Hyperlransport enables  signifi- 
cant increases in communication speed 
between chips and I/O functions. It is 
scalable and can be used in expanding 
a dual-core design into a quad-core de- 
sign through the use of stackable exten- 
sion boards. 

HyperTransport is both competitive 
with, and complementary to, PCI Ex- 
press. Either can be used for both chip- 
to-chip and board-to-board intercon- 
nection, and they can be deployed either 
exclusively or together, depending on 
the application. A typical dual-core 
CPU design interfaces Hyper Transport 
with PCI Express. This takes advantage 
of PCI Express’ support for a wide se- 
lection of chips as well as HyperTrans- 
port’s throughput performance where 
it counts most, allowing embedded de- 
signs to be optimized for both function 
and performance. 

HyperTransport features two uni- 
directional, point-to-point, high-speed 
connections that integrate chips, boards 
and other bus structures. The technol- 
ogy is also used in the integration of 
DDR memory with the CPU, enabling it 
to reside on the same die space. Sepa- 
rate HyperTransport links serve the I/O 
functions. 


Applying Dual-Core Technology 
to High-Performance 
Embedded Designs 

WIN Enterprises was one of the 
first board vendors to apply dual-core 
AMD Opteron technology to the needs 
of high-performance embedded com- 
puting. Working closely with AMD, the 
company had to innovate in order to 
solve several different problems in ap- 
plying the technology to mobile servers, 
imaging devices and databank manage- 
ment applications. 

First, an appropriate form-factor had 
to be decided upon. After evaluating a 
range of formats, including mini ITX, 
EBX, PICMG 1.3, ETX and EPIC, an 
EBX SBC was selected (Figure 3), since 
it is increasingly sought by designers of 
high-performance embedded systems, 
partly for its small size. 

WIN decided to populate the EBX 
form-factor of the MB-06047 SBC with 
state-of-the-art components. These in- 








cluded dual-core Opteron processors 
with HyperTransport, PCI Express, 
USB 2.0, ALC850 audio and Gigabit 
Ethernet. 

In designing this board, the nVidia 
nForce 2200 chipset was chosen to work 
with the AMD CPU. However, the two 
had never been used together in a small 
form-factor, which presented some de- 
sign challenges. The successful mating 
of dual-core and small form-factor was 
a breakthrough for the embedded OEM 
market. 

Other challenges were overcome by 
designing a 10-layer motherboard rather 
than utilizing the traditional 6 layers. 

Nextcom, a leading manufacturer 
of extreme performance, mobile, small- 
footprint computing products, is utiliz- 
ing the Opteron dual-core CPUs on a 
related design, the MB-06048, which 
is a PICMG 1.3 form-factor. This is be- 
ing used in a field-rugged, mobile data 
communications server used by military 
and government agencies. The advanced 
SBC enabled Nextcom to respond to 
market requirements for a distributed 
computing appliance that integrates 
legacy technology, performs multiple 
processes simultaneously, utilizes the 
advantages of COTS technology and al- 
lows application customization as mar- 
ket needs evolve. 

The high-performance comput- 
ing power of both single- and dual-core 
AMD Opteron processors is being lever- 
aged in Nextcom’s field-deployable units, 
the FleXtreme Vigor and NextDimension 
products. These small units top out at 2.6 
GHz per processor. They use the stack- 
able HyperTransport extension boards to 
offer quad-core CPU processing capabil- 
ity to military, government agency and 
other customers. 

A WIN PICMG 1.3 reference design 
is also being used by a major workstation 
vendor in its medical imaging solution. 


Software Considerations 

Software is increasingly a concern 
in high-performance embedded designs, 
and that usually means Linux. To com- 
plement its efforts in high-performance 
small form-factor designs, WIN devel- 
oped its own standard BIOS, as well as 
a downloadable Linux image for product 
testing and a Linux SDK. 


Dual-core CPU technology is at the 
leading edge of high-performance em- 
bedded designs. Dual-core and Hyper- 
Transport technology enable a significant 
advance as a standardized x86-based plat- 
form that fulfills the requirements of low 
latency and high performance in a small 
form-factor. This approach is seeing a 
high level of OEM interest, evaluation and 
application. @ 
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system Tracing Tools 
Ease Transition to 
Multicore Processors 


Although multicore processors can deliver higher performance per watt 
and true concurrency, they require different programming models from 
those used for uniprocessors. System tracing tools can ease the transition 
to multicore processors by simplifying troubleshooting and design 
optimization, as well as aid the process of migrating legacy code to 
multicore hardware. 


by Derrick Keefe and David Inglis 
QNX Software Systems 


MPC8641D 


tion and excessive operating tem- 

peratures caused by high CPU clock 
speeds, microprocessor vendors have ad- 
opted a new approach to boosting system 
performance: integrating multiple, inde- 
pendent processor cores on a single chip. 
Intel, for example, has proclaimed that all 
of its new CPUs will use multicore archi- 
tectures and recently produced a roadmap 
that details processors based on two, four 
and eight cores. 

Multicore processors are taking 
root in embedded designs, with the in- 
troduction of chips such as the dual-core 
Freescale MPC8641D (Figure 1), the 
dual-core Broadcom BCM1255, the quad- 
core Broadcom BCM1455 and the dual- 
core PMC-Sierra RM9000x2. Processors 
like these will play a big role in embed- 
ded applications, especially in networking 
and communications systems, which have 


7 aced with growing energy consump- 
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shes pte G ae aren Multicore chips such as the Freescale MPC8641D dual-core PowerPC 
P processor offer much better performance per watt than existing 
uniprocessor designs, as well as truly concurrent multi-tasking. 
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long pushed the limits of conventional uni- 
processor technology. 

Compared to uniprocessor chips, 
multicore processors offer several advan- 
tages, including true concurrency and 
greater performance per watt. The prob- 
lem is, most embedded software design- 
ers and engineers have little or no exper- 
tise in the programming models used for 
multicore chips. Instead of relying on in- 
creasing clock speeds to achieve greater 
performance, they must now learn how 
to achieve the highest possible utilization 
of every available core. Without the right 
tools, however, it can be extremely difficult 
to assess whether maximum utilization 
has, in fact, been realized. 

Migrating legacy code to multicore 
hardware is also an issue. In a uniproces- 
sor system, the OS automatically serializes 
the operation of applications. Multiple tasks 
may appear to run simultaneously, but in 
fact only one task runs at any point in time. 
In a multicore system, multiple tasks really 
do run concurrently, and this can quickly 
accentuate any incorrect assumptions that 
an application makes about access to shared 
system resources. Consequently, an appli- 
cation that runs perfectly in a uniprocessor 
system may suddenly behave incorrectly 
when deployed in a multicore environment. 

To address these issues, vendors such 
as QNX Software Systems have introduced 
system tracing tools. These provide a com- 
prehensive view of a multicore system, al- 
lowing the developer to visualize detailed 
system interactions and pinpoint potential 
bottlenecks such as excessive message pass- 
ing between cores (Figure 2). Such tools 
can also help analyze resource utilization, 
including CPU usage, on a per-application 
basis and suggest how to distribute applica- 
tions across cores for optimal performance. 


Reducing Excessive IPC 
Between Cores 

Excessive message passing between 
cores can seriously impede system perfor- 
mance in a multicore design. Every time a 
core sends a message, it must write the mes- 
sage to memory and send an interrupt to the 
receiving core. The receiving core must then 
service the interrupt, schedule the software 
process or thread that will handle the mes- 
sage and read the message from memory. 
Together, these operations entail considerable 
overhead, especially since they might not use 





System-tracing view 








Traditional debugger view 


Analyzing the system-level interactions among the multiple software 
components of multicore systems is beyond the scope of traditional 
debuggers, especially since many of these components communicate 
across cores. Consequently, developers need system-tracing tools that 
can analyze how the multicore system behaves as a whole. 


information stored in the processor cache. 
As a result, processes or threads that com- 
municate frequently with one another across 
cores—for example, a process that provides 
services to client applications—can consume 
a noticeable amount of system capacity. 

A tool that performs system-level trac- 
ing is invaluable in identifying this kind of 
behavior. Such a tool can gather a massive 
amount of system information, including 
hardware interrupts, kernel calls, schedul- 
ing events, thread-state changes and vari- 
ous forms of interprocess communication 
(IPC), such as signals and messages. To 
be useful, however, the tool must allow 
the developer to filter out everything ex- 
cept for the information relating to inter- 
core communications, in order to quickly 
see which threads are communicating be- 
tween cores and identify which cores are 
executing a given thread over time. 

To further isolate the problem, the tool 
can provide statistics such as interrupts per 
thread, which help identify threads that 
cause excessive inter-core communication. 
The tool could also display CPU utiliza- 
tion to help the developer examine core 
utilization from a system perspective, a 
process perspective and a thread-priority 
view. A system perspective helps identify 
potential problem areas, such as whether 
a core 1s underutilized or 100% utilized. 
A process perspective helps the developer 


drill down to see what threads are doing 
in a problem area. A thread-priority view 
aids in detecting whether certain threads 
are getting too much or too little CPU time 
because of their assigned priority. 

Using this information, the frequency 
and efficiency of inter-core communica- 
tions can be gauged. From there, the de- 
veloper can make an informed decision 
about how to correct the problem. For in- 
stance, processes that have a high affinity 
for one another can be bound to the same 
processor core. Doing so would reduce 
the overhead of inter-core communica- 
tions and eliminate the attendant cache 
thrashing that degrades performance. The 
same system-level tracing tool could then 
be used to profile the new configuration 
and measure the efficiencies gained. 


Finding Opportunities for 
Parallelism 

To maximize application — perfor- 
mance on multicore processors, the devel- 
oper must take advantage of the hardware 
parallelism offered by these chipsets. 
First, it must be determined where paral- 
lelism will have the biggest payoff. Start- 
ing with a system-tracing tool, processes 
or threads within processes that consume 
large amounts of CPU time can be singled 
out. Once a particular process or set of 
processes has been identified, an applica- 
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Using an application profiler, a 
developer can quickly pinpoint 
compute-intensive functions, 
in this case, nanospin_clock(). 
An application profiler 
can also display call-tree 
information, which helps 
identify execution paths that 
consume the greatest number 
of CPU cycles. 


tion profiler can be deployed. This ana- 
lyzes function-level performance within 
individual processes to determine which 
code inside a process or thread is consum- 
ing the most CPU cycles (Figure 3). 

Once compute-intensive functions 
have been identified, the developer can 
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apply a parallelization strategy. For in- 
stance, if a compute-intensive algorithm 
has independent steps, breaking the algo- 
rithm into multiple functions and spawn- 
ing those functions as separate threads 
can improve performance in a multicore 
symmetric multiprocessing (SMP) sys- 
tem. This focused approach to parallel- 
ization using system-tracing tools helps 
ease the transition to multicore platforms 
and enables applications to realize the in- 
creased throughput and performance that 
these architectures promise. 


Eliminating Resource 
Contention 

The first thing a developer may no- 
tice after moving software to a multicore 
environment is that performance doesn’t 
increase as much as expected. In many 
cases, this slower performance results 
from resource contention. 

By allowing processes to run concur- 
rently, a multicore processor can expose re- 
source contention issues never encountered 
on a uniprocessor system. For instance, a 
common SMP bottleneck occurs when two 
or more threads share a data structure pro- 
tected by a mutual exclusion lock, or mutex. 
A mutex protects a resource by preventing 
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System-tracing tools such as QNX Momentics can be used to optimize 
performance in a multicore system. The Multicore Info panel (top left) reveals 
where the greatest amount of core-to-core communication and thread 
migration is occurring, while the panel at top right identifies when a given 
thread is migrating from one core to another. The Overview/CPU Activity panel 
(bottom) shows which part of the system tracing log file is being analyzed. 


50 Fe January 2006 


multiple threads from accessing the resource 
at the same time. Thus, if two threads on sepa- 
rate cores both access resources locked by the 
mutex, those threads can spend considerable 
time contending for the lock. Instead of run- 
ning concurrently with one another—which 
is the main benefit of a multicore design—the 
threads must take turns executing. 

One example of resource contention is 
the routing table in a networking applica- 
tion. In a uniprocessor environment, only 
one process can access the routing table at 
a time. In a multicore environment, on the 
other hand, threads on each core can access 
the table simultaneously, thereby creating 
contention. Being able to identify the source 
of such contention can save a developer a 
significant amount of time and frustration. 
In fact, eliminating contention can have a 
huge performance payoff, even in systems 
that appear to run acceptably fast. 

To help identify resource contention, 
a well-designed system-tracing tool may 
provide several features. For instance, it 
can highlight processes that are frequently 
ready to run but are blocked, generate sta- 
tistics for threads that are blocked because 
of resource contention caused by threads 
on other cores and provide a graphical rep- 
resentation of core-to-core messaging. 

Other features—such as a search fa- 
cility for finding specific system events 
and a timeline that graphically displays 
the flow of execution with high-resolution 
timestamping—allow high-runner cases to 
be examined and the root cause of the con- 
tention to be pinpointed. Armed with this 
information, the developer can decide how 
best to optimize the system. For instance, 
the application can be divided into more 
threads to further parallelize the computa- 
tion. Alternately, the source of contention 
can be removed by, for example, replicat- 
ing the resource across cores. 


Using Performance Counters to 
Pinpoint Bottlenecks 

Some multicore processors allow doz- 
ens of performance metrics to be captured, 
which can be used to isolate performance 
bottlenecks. These metrics range from ex- 
tensive cache metrics to simple cycle counts, 
that is, the number of CPU clock cycles that 
occur between reads of a counter. Although a 
large number of different metrics can poten- 
tially be counted, the hardware usually pro- 
vides a limited number of counter registers 








to do so. Because each counter is capable 
of counting one of many possible metrics, 
configuring a given counter for a particular 
metric can be a daunting task. 

A well-designed system-tracing tool 
can address this problem by “pre-configur- 
ing” high-runner metrics. For instance, the 
tool can highlight excessive cache misses, 
which can indicate unnecessary thread mi- 
gration from one core to another. The tool 
can also highlight bus stalls, which suggest 
that two concurrent threads are running in 
a non-optimal manner. 


Reducing Unnecessary Thread 
Migration 

To achieve optimal performance in 
a multicore system, some operating sys- 
tems support soft processor affinity: the 
OS scheduler will always try to dispatch 
a thread to the core where the thread last 
ran. That way, the core can often fetch the 
thread’s instructions directly from the Ll 
cache, rather than having to reload them 
from the L2 cache or main memory. In 
some cases, however, threads will drift 
from core to core, overwriting each other’s 
cached instructions and forcing the Ll 
cache to be continuously reloaded. Each 
time a thread is rescheduled to run on an- 
other core, the performance advantage of 
using information in the L1 cache is lost. 

Unnecessary thread migration can re- 
sult from higher-priority threads interrupt- 
ing lower-priority threads. Each time such 
an interrupt occurs, there is the possibility 
that the OS will schedule the interrupt- 
handling thread on another core. The more 
cores on the chip, the higher the probability 
is that this will occur. By providing perfor- 
mance counter information, a system-trac- 
ing tool can make it much easier to diag- 
nose this condition. For instance, the pro- 
filer can provide statistics on the number 
of core-to-core migrations for a particular 
thread (Figure 4). By displaying these sta- 
tistics graphically, the profiler can provide 
a system-wide perspective, making it easier 
to Zoom in on potential problem areas. 


A Comprehensive Approach 
While the right tools can greatly sim- 
plify the task of troubleshooting and op- 
timizing a multicore design, they cannot 
work in isolation. An OS, if designed cor- 
rectly, can also help reduce the complexity 
of deploying software on a multicore chip. 


This is especially true if the OS can trans- 
parently manage the allocation of shared 
hardware resources in a multicore chip. 
Moreover, an OS designed for multicore 
processing can provide the flexibility to 
implement the best optimization strategy 
that the tools suggest. The QNX Neutrino 
RTOS, for example, supports bound multi- 
processing. This is an execution model that 
offers the transparent resource management 
of traditional SMP, but that also allows the 
locking of any process to a specific core. 
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Mercury Computer Systems’ 
James “Jay” R. Bertelli 


RTC: One of the first questions that 
comes to mind—and that Mercury has 
been prominently in the news about—is 
about Cell computers and Mercury’s 
relationship with IBM. IBM?’s Cell 
computer made a big splash in the me- 
dia primarily as an engine for high- 
powered games. However Mercury 
has brought the Cell processor into the 
embedded-computer industry. Can you 
tell our readers what your plans are for 
the Cell processor and what applica- 
tions are likely to gain the most from 
its application? 


Bertelli: First of all, I want to thank RTC 
magazine for the opportunity to address 
your readership. We, at Mercury, are very 
excited about the IBM Cell Broadband 
Engine (BE) processor. The customer 
response to the Cell processor has been 
tremendous. Leveraging Mercury’s new 
Cell-based products and services, our 
customers tell us that they expect to make 
a real difference in health care, interna- 
tional security, oil and gas exploration 
and other application areas. 

The Cell processor is the most sig- 
nificant architectural advance in high- 
performance embedded computing since 
Apple, IBM and Motorola (now Freescale) 
introduced AltiVec-enabled processors 
over a decade ago. At that time, AltiVec 
leapfrogged competitive offerings and be- 
came the dominant architecture in signal 
and image processing. We see the Cell 
architecture as more than revolutionary; 
it represents a shift in the tectonic forces 
acting upon embedded computing. 

For many years, Apple’s multimedia 
applications drove the high-end processor 
roadmaps of Freescale and IBM. Processors 
that had been targeted at this larger space 
were then retargeted at embedded comput- 
ing. This symbiotic relationship was stable 


for several years, with Freescale 
AltiVec and IBM VMX com- 
manding performance and market 
share leadership over alternatives. 

The IBM Cell BE processor 
enters the embedded scene at a 
time when the multimedia desk- 
top market has yielded to graph- 
ics and gaming as a major driving 
force in high-end embedded pro- 
cessor innovation. So when IBM 
came to us, we realized that we were wit- 
nessing another industry milestone. When 
combined with Mercury’s expertise with 
multi-computer enablement and optimiza- 
tion, this new processor could accelerate 
numerically intensive applications by as 
much as two orders of magnitude. 

In order to turn a processor’s potential 
into reality, it takes more than just silicon. 
Developers need a robust development 
environment and an array of related tech- 
nologies and support. We call this Mer- 
cury’s MultiCore Plus Advantage—this 
includes our optimized libraries, software 
tools, middleware and algorithm tuning 
expertise. We also lead the industry in 
creating solutions that can be deployed in 
the world’s harshest environments. 

We signed an agreement with IBM 
to harness the power of the Cell BE pro- 
cessor for embedded designers in both 
board- and system-level products. Today 
we are pleased to report that by the time 
this article reaches readers, our first cus- 
tomers will have taken delivery of initial 
cell development systems for their work 
on existing applications in both commer- 
cial, OEM and defense applications. 

But I can tell you that what is really 
the most exciting facet to me personally 
is that Cell opens up new possibilities 
for OEMs and defense contractors to put 
hundreds of GFLOPS in small, light- 
weight and low-power configurations— 





potentially resulting in brand new appli- 
cations that were previously not thought 
possible. We are receiving interest in Cell 
hardware, software and related services 
from a wide variety of customers in semi- 
conductor inspection, complex vehicle 
navigation, video applications, digital 
media, biotechnology and even gaming. 

In October 2005, Mercury an- 
nounced our first product, the Dual Cell- 
Based Blade, which is based on Cell 
technology and the IBM industry-lead- 
ing BladeCenter standard. This product 
offers 400 GFLOPS on a single blade or 
16 TFLOPS in a 6-foot rack. The Linux 
and Eclipse-based Cell blade is targeted 
at raised-floor or benign embedded envi- 
ronments, and it also makes a great devel- 
opment system for ruggedized products, 
which we will announce shortly. In No- 
vember, we announced our second Cell- 
based product code-named “Turismo” 
that will pack 800 GFLOPS into a 600 
cubic-inch footprint. Four Turismo boxes 
in a 5U configuration are expected to 
yield a peak performance of 3.2 single- 
precision TeraFLOPS and more than 25 
TeraFLOPS in a 6-foot rack. We plan to 
announce additional Cell-based products 
in the next several weeks. 
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RTC: In a recent article in the Wash- 
ington Post entitled “A Nervous Eye 
on Defense Firms” the following para- 
graph appeared: 

...some analysts feel the slump in 
investor enthusiasm could soon trickle 
down to the usually buoyant stocks of 
the industry’s small to mid-size play- 
ers. Many of the firms specializing in 
information technology and electronics 
continue to have “‘premium valuations” 
that are “unwarranted,” Joseph B. 
Nadol, aerospace and defense analyst 
for JP Morgan, said in a recent note to 
clients. “As we believe defense growth 
will slow to near-zero in 2007, we do not 
believe these companies will sustain the 
double-digit top line growth that their 
managements and investors are target- 
ing,” the report said. 

As a large component of Mercury’s 
revenue is derived from military and 
aerospace industries, do you believe 1) 
that the above is true? And 2) is Mer- 
cury moving more aggressively into 
other arenas such as industrial control, 
medical instrumentation and commu- 
nications to lessen its dependency on 
the military market? 


Bertelli: To help your readers understand 
how those observations impact Mercury, 
I need to explain three things. First, we 
have been selling in commercial markets 
from Mercury’s earliest days—in fact, our 
first customer more than 20 years ago was 
a semiconductor wafer-stepping equip- 
ment manufacturer. 

Second, leveraging our technology 
investments across multiple market seg- 
ments has been part of our strategy for a 
long time. For example, our amira software 
enables users to render complex 3-dimen- 
sional data and has applications in nearly 
every market that we serve. The cross 
fertilization of our business units helps us 
to maintain competitiveness and benefits 
our customers by providing them with a 
broader and more dynamic solution base. 

Lastly, I believe broad generalizations 
like the one from JPMorgan don’t provide 
the whole story. There are many sub-seg- 
ments within the aerospace and defense 
electronics market. While some are ma- 
ture, others promise dynamic growth. We 
are getting traction with a solution we call 
ARIES (Airborne Reconnaissance Image 
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Exploitation System), which enables mul- 
ticomputers to execute image exploitation 
algorithms closer to the sensors in UAVs. 
This has implications in delivering bet- 
ter real-time data to warfighters. Mercury 
seeks and evaluates attractive opportuni- 
ties on an ongoing basis and pursues those 
in which we can add significant value—in 
some cases regardless of what is happen- 
ing at the overall industry level. 

So, to summarize, we will continue 
evaluating current and new market oppor- 
tunities in both commercial and defense 
markets with only modest regard for com- 
mentary like the one you cited. Our focus 
is much more on understanding how we 
can add customer value across markets. 


RTC: The ATCA specification has raised 
a lot of interest recently. Venture Devel- 
opment Corp. was upbeat about its pros- 
pects in a recent report indicating the 
market could reach $8 billion by 2009, 
but commented that the bulk of the ac- 
tivity will be in captive manufacturing 
by end users and by contract manufac- 
turers. Do you believe the ATCA market 
could grow to such dimension over the 
next few years? And, do you think the 
merchant board and subsystem mar- 
ket—as opposed to captive and contract 
manufacturing—will be viable? 


Bertelli: As you know, Mercury ed- 
ited the PICMG 3.5 AdvancedTCA and 
AMC.4 standards. Mercury has been a 
supporter and driver of PICMG, VITA 
and other industry standards for many 
years, and many of our telecommunica- 
tions customers are showing significant 
interest in AdvancedTCA. We receive a 
wide range of projections from industry 
analysts on ATCA and AMC, but I can 
say this, AdvancedTCA does not need to 
become an $8 billion market in order for it 
to bring tremendous value to our custom- 
ers. Every major communications silicon 
vendor is offering its newest silicon solu- 
tions on the AdvancedTCA form-factor, 
and that makes ATCA a very convenient 
and low-risk choice for evaluating new 
technology and architectures. 

In regard to your other point, it is 
key to remember that there are profit- 
able niches for small- and medium-sized 
firms if they are properly differentiated. 
For example, our Echotek product group 


has a world-class facility for manufac- 
turing analog and mixed-signal boards 
with very low signal noise. That type of 
expertise is a competitive advantage for 
us. Furthermore, there is no inherent con- 
flict between contract manufacturers and 
merchant board vendors who are in the 
business of doing highly optimized board 
design and multicomputer integration. In 
fact, like the system OEMs and contrac- 
tors, it seems to me that the major mer- 
chant board manufacturers are currently 
benefiting from outsourcing trends. 

In fact, after the restructuring of the last 
few years, the Telecom Equipment Manu- 
facturer (TEM) development organizations 
have been pared back while competition has 
grown fierce. So in our case, TEMs are com- 
ing to us for expertise in piecing together so- 
phisticated system-level solutions. 

In response, Mercury put together 
the Ensemble AdvancedTCA system with 
serial RapidIO switch infrastructure, pro- 
cessor AMCs, open-standard IP where 
needed and a complete software environ- 
ment. This program required us to engage 
with silicon and module vendors to ensure 
that everything would come together. We 
managed third-party supplier schedule 
slips, formed contingency plans and pre- 
sented our customers with a fully inte- 
grated development platform. We created 
all of the necessary software, middleware 
and stacks necessary to generate next-gen- 
eration telecom applications. Our OEM 
customers were overjoyed. They were 
used to doing all the integration them- 
selves, and it diverted their focus away 
from their own special sauce. 


RTC: Many have been critical of the Ad- 
vancedMC and MicroTCA specifications 
as being inadequate for any but the most 
benign communications applications. 
In addition, it has been criticized for 
other limitations such as the maximum 
number of layers, frailty of the connec- 
tor and lack of power management. Do 
you believe such perceived limitations 
will limit the useful applications for the 
spec? Do you think the recently started 
VITA 56 committee’s efforts will result 
in a specification that will compete with 
or co-exist with AMC? 


Bertelli: Mercury invests in core technol- 
ogies like processors, fabrics and software 
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PCI Express promises to build upon PCI’s success in 


serving as the bread and butter interconnect for local |/O 


and small clusters of processors. 


infrastructure. These capabilities transcend 
module form-factor so we don’t lose sleep 
over bus wars or processor wars or form- 
factor wars. We have a treasury of intel- 
lectual property and applications expertise 
that our customers ask us to apply to either 
a narrow or broad area of focus. Our cus- 
tomers work with us because we help them 
solve their problems. If their problem state- 
ment is phrased around a certain form-fac- 
tor, we can easily accommodate that. When 
we produce a system-level product, we 
choose the technologies that are the best fit 
for target markets. 

It should come as no surprise that 
VITA standards are stronger in defense and 
PICMG standards are stronger in telecom- 
munications and image processing. Medical 
imaging, on the other hand, employs indus- 
try standard architectures like blades. In 
general, these system and module form-fac- 
tors are applied very differently and do not 
compete with one another. Unlike some of 
the other industry players, Mercury is fortu- 
nate to have the R&D scale and breadth to 
support the numerous standards that are crit- 
ical to our customers. So which form-factor 
does Mercury choose to support? Whichever 
form-factor is best for the job at hand. This is 
why you see products coming from Mercury 
in ATCA, CompactPCI, PCI, VXS (VITA 
41), VPX (VITA 46), VPX-REDI (VITA 
48), IBM BladeCenter and others. 

As long as we are talking about R&D 
strategy, I should point out that in some ar- 
eas, Mercury has expanded well beyond the 
hardware platform. For example, we offer a 
visualization software solution called Vis- 
ageRT that provides embedded components 
for accelerated reconstruction of images 
in medical, life sciences and biotechnol- 
ogy. VisageRT does bring optional GPU or 
FPGA numerical acceleration to commod- 
ity hardware platforms, but there is even 
more value-add in our highly optimized 
software tools and algorithms that integrate 
seamlessly into the existing frameworks of 


medical OEMs and ISVs. Additionally, we 
offer professional services to help custom- 
ers minimize their development time and 
cost. We believe Mercury leads in our abil- 
ity to service our customer requirements at 
virtually any level of the value chain. 


RTC: Mercury has been a developer 
and strong supporter of RapidIO since 
its inception. Has RapidIO met Mercu- 
ry’s expectations both technically and 
in the marketplace? Do you perceive 
that PCI Express, or AS are directly 
competitive with RapidIO or comple- 
mentary? Why? 


Bertelli: Mercury co-invented RapidIO 
with Freescale (formerly Motorola Semi- 
conductor) back in the late 1990s and later 
co-founded the RapidIO Trade Association. 
Originally, we conceived of RapidIO as a 
logical evolutionary step and heir to our 
RACEway Interlink. In our collaboration 
with Freescale, we worked to address the 
broadest set of market requirements pos- 
sible. The RapidIO Trade Association now 
has about 50 member companies and virtu- 
ally all of them are actively deploying, de- 
veloping or creating products with RapidIO. 
At this point, Mercury has RapidIO sys- 
tem solutions available in AdvancedTCA, 
CompactPCI, VME VXS (VITA 41) and 
Multiport form-factors. Other vendors have 
announced VME64x, VXS and VPX point 
solutions using RapidIO. Mercury Rapi- 
dIO systems are found in the Global Hawk 
UAV, the Aegis Cruiser, Micronic inspec- 
tion equipment and in some other of the 
most rugged and demanding applications 
in the embedded landscape. No other fab- 
ric can credibly make claims of this mag- 
nitude. People have always acknowledged 
that RapidIO is better technology. Now its 
ecosystem and customer adoption are also 
rapidly moving forward. 

However, embedded computing is 
not ever likely to entirely consolidate 


around a single technology and that state- 
ment applies to fabrics as well as proces- 
sors, form-factors, languages, middleware 
APIs and operating systems. Every fabric 
has its strengths and appropriate target 
markets. Again, Mercury is fortunate to 
have the scale and R&D capacity neces- 
sary to deploy every major interconnect 
and fabric across our diversified product 
lines. This makes us uniquely qualified to 
help our customers choose the best fabric 
for their particular needs. 

InfiniBand still enjoys strength in su- 
percomputing and cluster environments 
like storage and blade servers. In fact, 
InfiniBand proved to be the ideal choice 
in our BladeCenter-compatible Dual Cell- 
Based Blade product. Ethernet is also an 
important part of our product line. Our 
Momentum Series VPA-200 single board 
computer supports Ethernet to the back- 
plane as a means to connect to adjacent 
gigabit class I/O devices. Ethernet is also 
a great generic box-to-box interconnect, 
aside from being the only reasonable 
choice for wide area networking. 

Advanced Switching has been slow to 
develop an ecosystem, but we are tracking 
it. ASI’s main value proposition appears 
to be centered on its ability to tunnel PCI 
traffic, but that problem has many other 
more mature solutions. For example, Mer- 
cury supports the encapsulation of PCI 
Mezzanine Card (PMC) traffic on our 
PowerStream 6100 VME VXS multicom- 
puter, which uses a serial RapidIO back- 
plane fabric. 

PCI Express, on the other hand, is quite 
different from ASI. PCI Express promises 
to build upon PCI’s success in serving as 
the bread and butter interconnect for local 
I/O and small clusters of processors. When 
a problem grows to the point where a sim- 
ple interconnect is not enough, the system 
designer has to introduce backplane fabric 
capability. PCI Express is architecturally 
identical to PCI. Mercury has successfully 
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deployed RapidIO to bridge PCI clusters 
so we see RapidIO and PCI Express simi- 
larly working hand in hand. 


RTC: There seems to be a trend in the 
embedded-computer industry to at- 
tempt to supply more and more com- 
plete systems to OEMs, thus climbing 
up the food chain in hopes of increasing 
revenue with the additional parts of the 
systems supplied. Many companies have 
done this through acquisitions that al- 
low the company to provide additional 
parts of systems. Do you believe the in- 
dustry will continue to consolidate in 
this way? At what point do you think 
embedded-computer suppliers will be- 
gin competing with their customers? 
Will a time ever come when there are 
only a small handful of system mak- 
ers and the traditional merchant board 
market will go away? 


Bertelli: There was a time when it was 
considered forward thinking to bundle an 
operating system and development tools 
with a merchant processor board. Today, 
customers take these things for granted 
and ask us to add value in new ways. The 
key is to focus on solutions that enable our 
customers to achieve their goals better and 
faster than they did in the past. Mercury 
has been designing at the system level for 
many years, and I can tell you that we are 
being asked to do more and more by many 
of our customers. 

Having said that, customers now re- 
quest a wide variety of products and ser- 
vices. In one case, they may ask us to pro- 
vide a single module or to license some 
of our IP for one of their own modules. 
In other cases, they seek an integrated set 
of boards or assistance in the application 
space. There are even cases where custom- 
ers ask us to take their IP, integrate it into a 
computer system that we build, and ship it 
all back to them. The business models that 
customers propose usually derive from 
their own unique product-line strategy. 

Yet, system solutions are not the only 
formula for growth. Our Momentum Series 
single board computers are excellent ex- 
amples of how innovation can trump com- 
moditization. Designing a 3U CompactPCI 
board with Penttum M or PowerPC proces- 
sors 1s considered to be straightforward. 
But Mercury has the design processes per- 


58 FM Janaury 2006 


fected to put a customized version of that 
board in your receiving department in 12 
weeks instead of 37 weeks. That opens up 
a whole new set of possibilities for telecom 
OEMs who are under tremendous time- 
to-market pressure. Our Momentum Se- 
ries product line has an advantage that is 
the result of a disruptive business model. 
The traditional board market can survive 
if vendors innovate and maintain a value- 
add. This is accomplished by working with 
customers, not against them. 

We read board suppliers in the de- 
fense electronics market proclaim their 
intention to become third-tier contractors. 
This is 180 degrees counter to Mercury’s 
strategy. We do not sell defense solutions 
directly to the U.S. government. We stay 
clearly focused on providing products and 
services that accelerate our customers’ 
time-to-market and help them pursue their 
own differentiated value propositions. 

Mercury’s acquisition strategy in 
defense speaks to our commitment to 
continually strive to better serve our 
customers. Our Echotek Series (radio fre- 
quency and mixed signal), Momentum Se- 
ries single board computers and Mercury 
DSP modules all work together in our new- 
est PowerStream 6100 multicomputer. In 
the future, we envision functionality from 
these modules sitting together on a single 
card. The synergy and benefit to custom- 
ers 1s quite clear. Unlike acquisition strate- 
gies that are focused on bulking up reve- 
nue, this is a technology-driven strategy to 
provide our customers with products that 
enable them to do more. 


RTC: Other than the military mar- 
ketplace, Mercury has been a major 
player in the medical imaging business. 
And, according to reports in the press, 
CT scanning equipment as well as PET 
scans and even E-Beam tomography 
are getting better and better. What 
parts of the imaging technology are 
improving because of improvements in 
computers? Could you tell our readers 
a little about the special requirements 
for technology such as real-time, three- 
dimensional scans? 


Bertelli: Improvements in computational 
capability will sharpen image quality in 
clinical applications, which will lead to 
better, more accurate patient diagnoses. 


New computing capabilities in visualization 
and graphics processing will also improve 
clinical applications and optimize hospital 
workflow, reducing cost and minimizing 
patient delays. For example, today we can 
offer a fully scalable thin client server that 
places any clinical application in 2D, 3D or 
AD at the point of care in any PC or laptop 
in the health enterprise. 

From a technical point of view, there 
are two main factors driving computational 
complexity in medical image reconstruc- 
tion: de-noising of the input data with en- 
hancement of edges, and reconstruction of 
3D or even 4D volumes from 2D views. 
Industry-standard architectures like the PC 
fall far short of the computational power 
required. 4D volumes are even more chal- 
lenging, because they include 3D images 
with the added dimension of time, such as 
the 3D image of a beating heart. 

The new IBM Cell BE processor boards 
from Mercury can perform a 3D volume 
reconstruction in just a few seconds versus 
about 5 minutes on a conventional proces- 
sor. We show this side-by-side comparison 
in a short video available on our Web site 
(www.mce.com/cell/demo.cfm). Cell tech- 
nology promises to significantly speed the 
reconstruction process, which has tradi- 
tionally been viewed as the bottleneck in 
the diagnostic workflow. Advances like 
these can improve the bottom line of the 
health enterprise by enabling more patients 
per day per scanner. 


RTC: Embedded-computer technology 
has continued along a growth curve fol- 
lowing Gordon Moore’s law. Advances 
such as the Cell processor continue to add 
speed and density. And all along clever 
designers figure out things to do with 
the additional computer horsepower. 
Provided the gods of physics don’t get in 
the way and compute power continues to 
grow, what applications do you envision 
say, a decade away? Twenty years? 


Bertelli: The industry is rich with process- 
ing solutions and not all of them are proces- 
sors. Many of our Echotek Series boards 
feature the Xilinx Virtex-4 FPGA, which 
can replace 250 general-purpose processors 
for certain algorithms. The industry is also 
very creative in finding ways to meet the 
needs of computationally intensive applica- 
tions. The Cell BE processor is a significant 








development, because it demonstrates that 
new architectures can deliver significant 
increases in performance without relying 
solely on brute force MHz increases and 
simple increases in gate density. 

Different applications will find differ- 
ent answers to the Moore’s Law challenge. 
We find pockets of strength for Freescale 
PowerPCs, Texas Instruments DSPs, 970, 
Pentium M, NVIDIA GPUs and, of course, 
Cell BE processors. Each has appropri- 
ate use for certain applications, depend- 
ing on the type of operations required (bit 
operations, scaler, single- or double-preci- 
sion floating point) or the unique applica- 
tion constraints (cost, power, size, weight). 
With a range of processor offerings, it’s 
easier to meet demanding requirements and 
guide customers to the best overall choice. 
Our experience in development tools and 
middleware APIs ensures that our custom- 
ers will develop applications productively 
even as processor architectures increase in 
complexity. Our customers don’t necessar- 
ily know what processors will be available 
in 20 years, but they can be assured that 
Mercury will offer them the best possible 
processing choices with the appropriate de- 


velopment tools in any given generation 
of technology. 

The future is difficult to predict, but 
we do see some interesting commonalities 
in the various markets that Mercury serves. 
One example is that sensor and scanner 
technologies are improving quickly and 
creating great challenges, particularly in 
markets where human analysis is still criti- 
cal. In medical imaging, technologies are 
delivering more pixels per scan and more 
scans per second. These trends, combined 
with an aging population, will put tre- 
mendous pressure on back-end analysis in 
radiology. Faster processing and better al- 
gorithms will lead to more timely and ac- 
curate diagnoses for patients. 

In the military market, airborne sen- 
sors are evolving to collect higher resolu- 
tion imagery at higher rates, but 90% of 
the data collected by UAVs is unexploited 
due to limited radio frequency transmis- 
sion bandwidth. The solution is to co-lo- 
cate image exploitation algorithms with 
the sensor on the UAV, but this requires 
enormous processing capability. 

In both these cases, we see a data 
glut. Increasing the processing capacity 


and density will help alleviate the prob- 
lem by screening, organizing and render- 
ing the data in ways that allow the human 
analyst to work more effectively. 

We also expect to see interesting uses 
of data navigation in embedded applica- 
tions. This would enable a radiologist to 
visually browse a 3D model of a CT scan 
volume looking at blood vessels, bones, 
organs and fractures from different an- 
gles. In the military, sensor data could be 
combined together to produce panoramic 
3D simulations, allowing pilots to train 
for missions using actual terrain data that 
is updated in real time. These and other 
emerging applications involve highly so- 
phisticated algorithms that require enor- 
mous processing power. 

The excitement over the Cell BE pro- 
cessor reinforces my conviction that tech- 
nologists will never cease to push the en- 
velope with new ideas, new ways of solv- 
ing problems and new algorithms. Many 
of these are simply awaiting the necessary 
processing power to be available on a cost- 
effective, deployable platform. @ 
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Integrating Model-Driven 
Development with IDE 
Breaks Productivity Barriers 


Model-driven development integrated with editor, compiler, debugger, 
RTOS and middleware manages complexity, lowers project risk and speeds 
time-to-market for complex software projects. 


by George LeBlanc 
|-Logix 


while the demands grow for completion and deployment 

in shorter development cycles. With this trend for intricate 
software, developers and architects call for technology solutions 
that can support, clarify and allow complex projects to success- 
fully meet the requirements and goals of a given design. While 
the “wall” that once separated the systems and software sides of 
development has in many organizations been dismantled, opti- 
mizing the process into one cohesive solution has not, until now, 
been fully addressed. In recent months, however, a redefinition, 
or some say a revolution, has occurred in the integrated develop- 
ment environment (IDE), resulting in significant usability and 
productivity gains, finally offering a better way to tie together es- 
sential system and software development functions in one seam- 
less tool environment. 

This revolution has occurred by integrating design in the 
form of model-driven development (MDD), the more familiar 
integrated development environment (IDE) and deployment— 
RTOS and middleware—into one seamless, bi-directional work- 
flow environment, so that systems and software engineers have a 
powerful, flexible tool that includes all phases of development in 
one tightly integrated tool chain. In redefining the IDE, model- 
driven development needs are covered by a graphical develop- 
ment and validation environment, offering the latest version of 
the industry standard Unified Modeling Language (UML) and 
the soon to be approved Systems Modeling Language (SysML); 
IDE needs are represented by a set of compiler, debugger and 
optimization tools; deployment needs are met by Green Hill’s 
real-time operating system, a microkernel and middleware. The 


T oday’s software projects continue to increase in complexity 
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first implementation of such a fully integrated environment is the 
result of a cooperative agreement between I-Logix and Green 
Hills Software. 

The powerful benefits contained within the integration make 
designing and developing in the redefined IDE compelling to 
software architects and designers. Rather than using a traditional 
“waterfall” approach where separate processes are used for dif- 
ferent aspects of development, for example separating require- 
ments analysis through design from the implementation through 
deployment process, now a tight, iterative model-based approach 
is used to rapidly construct and respond to requirements changes 
throughout the development process (Figure 1). 

The unified MDD/IDE solution generates the implemen- 
tation in C, C++ and Ada source code directly from the UML 
model, and the model is automatically updated to reflect changes 
made to the code, ensuring that the final product meets design 
objectives by providing traceability between the design, source 
code and the requirements. Additionally, this integration short- 
ens the debug cycle and improves product quality by offering 
powerful visual debugging and analysis capabilities to users. 


Working in One Integrated Environment 

The integration of the MDD environment, IDE and RTOS 
results in a solution that addresses all phases of embedded sys- 
tems development. Requirements analysis, system architecture, 
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Workflow showing MDD and IDE working together: As designs are constructed from within the MDD environment of 
Rhapsody, code can be automatically generated or linked into existing code at any point in the process. Rhapsody 
automatically invokes the necessary Green Hills tools in the chain to build the code into a complete application. This 
application can run on the host or in the target with bi-directional synchronization and debugging. 
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software design and application development are completed us- 
ing a combination of SysML and UML. Application behavior 
can be validated on the host platform as the application is being 
constructed through simulation provided by the MDD execution 
environment. The MDD environment can then generate produc- 
tion quality C, C++ and Ada source code automatically from the 
UML models, which can be fed into the IDE’s C/C++ and Ada 
compilers, and the object code then transferred to the target. 

The developer can take programs running on the target, set 
breakpoints on the graphics in the modeling environment and 
have the program stop at the same point in the IDE’s source-level 
debugger. Working from the other direction, a developer can load 
and debug the code in the IDE environment and set breakpoints 
as well. The modeling environment will highlight the graphical 
diagrams that correspond with the breakpoints, allowing a seam- 
less, bi-directional workflow in a clearly understood environment 
that is most natural to the user’s development process. The result- 
ing debugged application is then downloaded and deployed on 
the target through the IDE. Now the models from which the code 
was generated are linked to the executable code enabling run- 
time analysis and debug. Lastly, any changes made in the model 
or code are automatically synchronized, ensuring that the design 
and the code are always in step with each other. 


Boosting Quality and Meeting Deadlines 

This integration makes testing the application as it is devel- 
oped a fundamental capability that provides tremendous quality 
and time-to-market benefits. Such an environment gives the de- 
veloper the ability to eliminate design flaws as they appear, when 
they are cheaper to fix, long before the embedded hardware is 
available. In addition to simulating the application on a develop- 


Effort / time to release quality product 


ment host system, this integration enables developers to leverage 
the target simulator provided by the IDE to test the application as 
itis being developed. Also, the modeling tool can draw on the use 
cases and scenarios as specified during requirements analysis, or 
captured during run-time, as a test harness for the application. 

The modeling environment itself can automatically generate 
test vectors that can be executed on the target application, saving 
the developer tremendous amounts of work in manually creat- 
ing the testing scenarios. For deployment, the integrated solution 
generates all source code, configuration files and build files nec- 
essary to synthesize the generated code and any manually written 
code into a completely deployable application targeted for a spe- 
cific real-time operating system (RTOS) running on the target. 

Typical real-time embedded development environments are 
fraught with gaps in process and in the tool chain that make it 
extremely painful and costly to efficiently develop high-quality 
applications. The typical development process often starts with 
a “rush to code,” where an organization receives a work order 
and then leaps to start coding before completely analyzing the 
requirements and laying out the system and software architec- 
tures. The result is that the code often takes a completely differ- 
ent path from the requirements and the design. Moreover, there 1s 
typically no effective way to deal with requirements changes as 
they occur or measure their impact. Tight integration removes all 
of these concerns by enabling a single, self-documenting, MDD 
and IDE environment that ensures that the design and code are 
synchronized throughout the process. With this approach the 
impact of requirements changes can be easily analyzed prior to 
implementation, and any changes made to the implementation 
are validated and tested against the requirements, ensuring a 
closed-loop process (Figure 2). 





Complex applications and the need to balance code quality with time-to-market pressures require new solutions. To 
address these pressures, trends reveal moving away from completely hand-written applications to applications that use 
commercial RTOSs, middleware and automatically generated code in one tightly integrated environment. 
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Managing Complexity 

One key reason why MDD/IDE integration is so powerful 
is its ability to manage complexity. The MDD/IDE environment 
allows real-time embedded system and software developers to 
analyze complex problems by breaking them down into small, 
manageable units. It gives developers the ability to look at prob- 
lems from various perspectives and abstract away non-essential 
details in order to focus on the task at hand. With this integra- 
tion, developers can then accomplish manageable analysis and 
development activities and build the system from its component 
parts. Furthermore, this integration enables a tight coupling from 
requirements to design to code, ensuring that the system under 
design addresses the specific requirements as called out and 
managed in the respective requirements traceability tool. These 
requirements are not only an integral part of the process, but also 
an integral part of the model as well. Graphical modeling and 
analysis, traceability, execution, validation and visualization are 
all key contributors to effectiveness in dealing with complexity. 

When trying to debug complex problems, the integrated tool 
chain again simplifies these problems by offering a developer the 
ability to look at the specific issue at the right level of abstrac- 
tion, either at the model level or the source-code level. Real-time 
embedded developers need the right tools for the job, and debug- 
ging complex applications is greatly facilitated by having visibil- 
ity to both the UML model and the source code simultaneously, 
and being able to make changes to either representation and have 
the other representation automatically updated. This integration 
provides the needed flexibility and visibility into the run-time 
environment, making it one of the most powerful mechanisms 
for analyzing and debugging complex problems. 


Enhancing Productivity 

MDD/IDE integration supports faster, seamless workflows 
within one tool environment. This revolutionary integration is fo- 
cused on improving system and software developer productivity. 
This is accomplished by providing a bi-directional workflow that 
increases visibility into the application, both during the design 
and development phases as well as during test and debug and 
deployment. The model serves as the foundation for all develop- 
ment. This level of abstraction facilitates not just the many tasks 
performed throughout the project’s lifecycle; it also enhances 
communication across the team. Models simplify the process 
while at the same time increasing the speed and effectiveness 
with which a company or team can design a system, nimbly re- 
sponding to requirements changes, generating new implementa- 
tions and testing the generated application as they go. The trend 
is clearly to move away from hand coding to higher levels of ab- 
straction while maintaining close linkage and control over the 
code down to the final compiled and deployed result (Figure 3). 

One of the best ways to boost productivity is to reuse source 
code or designs that have already been proven in the field. The 
MDD/IDE approach gives developers the ability to do both—that 
iS reuse previously constructed UML models or reuse legacy 
code (C, C++ or Ada). 

With the push for complex software in ever shorter develop- 
ment cycles showing no signs of stopping, software developers 
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demand technology solutions that will give them the necessary 
tools for the challenging task at hand. In the past, software archi- 
tects and developers relied on an error-prone waterfall workflow 
that did not address the reality of how systems are constructed, 
and have struggled to find a better way to manage these chal- 
lenges. For today’s development needs, the solution is to optimize 
the iterative development process, improve communication and 
provide a bi-directional workflow between the design and the 
implementation. Developers must be able to visualize systems 
at the appropriate level of abstraction to deal with complexity 
and manage requirements changes that occur. The solution les 
within the latest revolutionary step to redefine the IDE to include 
the MDD, RTOS and middleware tools in one tightly integrated 
environment. @ 


|-Logix 

Andover, MA. 
(978) 682-2100. 
[www.ilogix.com]. 







Looking For More? Visit www.rtcmagazine.com to download 
additional technical information related to this article. 
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Solid-State Disk Targets Industrial Data Storage 


Data storage media used in industrial environments must possess 
fast startup times so that associated industrial equipment will come on- 
line as quickly as possible. Until recently, high-performance industrial 
solid-state storage disks have been considerably pricier than alternative 
media with similar performance. To help lower the barrier to their use, 
Adtron has introduced the I25F Entry-Point Flashpak IDE solid-state 
flash disk family. 


The [25F family, like other Adtron 

Flashpak solid-state flash disks, is based 

on single-level cell (SLC) NAND 

flash technology. With a capac- 

ity range from 256 Mbytes to 

8 Gbytes, the I25F flash 

disk family supports 

standard IDE transfer 

modes, PIO O-4 and 
MultiWord DMA 0-2. 

Options include either commercial 

temperatures (0° to 70°C) or industrial tempera- 

tures (-40° to +85°C). Single unit pricing for a 4 Gbyte I25F flash drive 

with a commercial temperature rating is $546. 


Adtron, Phoenix, AZ. (602) 735-0300. [www.adtron.com]. 





Managed 24-Port Gigabit Ethernet Switch 
Supports IPv6 


One key requirement of future programs within the Department 
of Defense’s Global Information Grid is the support of IPv6 for net- 
work-centric warfare. A 24-port Gigabit Ethernet switch from Radstone 
Embedded Computing that provides fully managed Layer 2/3 switching 
also supports IPv6. 


With a non-blocking shared memory architecture, the CPX24 
features a standard 24 copper GbE ports. Twenty of the ports are 
PICMG2.16-compliant, and the other four are available through an op- 
tional high-speed J4/J41 connector. All 24 ports can be converted to 
fiber via a mix of onboard optics and Radstone’s OXB20 Optical Ex- 
pansion Board, allowing operation over long distances or in electrically 

noisy environments. Two externally available 10 
GbE ports allow two CPX24s to be cou- 
pled as a 48-port GbE switch. 


Additional __ flexibility 
can be provided by bring- 
ing out the two 10 GbE ports 
to a separate, optional 10 GbE 
expansion board for very fast sub- 
system I/O. IPv6 support delivers ex- 

panded available IP address space (128- 
bit addressing) and improved end-to-end security, facilitating mobile 
communications, enhancing quality of service (QoS) and easing system 
management. The CPX24 can be ordered in any of five ruggedization 
levels. Price for a single, level-one unit is $7,380. 







Radstone Embedded Computing, Towcester, UK. 
+44 (0) 1327 359444. [www.radstone.com]. 
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Multi-Interface PCI Serial |/O Board Is RoHS- 
Compliant 


A single-port PCI bus serial I/O adapter card 
makes field-selectable connec- 
tions to PLCs, bar code read- (a 
ers and other data collection : 
devices and is compliant with 
the European Union Restriction 
of Hazardous Substances (RoHS) 
directive. The ULTRA 530.LPCI 
multi-interface PCI serial I/O board 
from Sealevel Systems offers a select- 
able RS-232/422/485/530 interface and is universal bus-compatible 
(3.3V or 5V). 


Compatibility with MD1 low-profile specifications makes the board 
especially useful for small form-factor applications, such as network ap- 
pliances, thin clients and 1U servers. Designed using the XR16C850 
UART, the ULTRA 530.LPCI supports standard PC data rates and 
boasts a top speed of 921.6 Kbits/s. The board provides a 128-byte FIFO 
for error-free data communications applications. UART options include 
a version that allows external clocking. Sealevel’s RS-485 auto-enable 
circuit automatically handles RS-485 driver control to facilitate com- 
patibility with standard COM drivers. 

The ULTRA 530.LPCI ships with Sealevel Systems’ SeaCOM 
suite of drivers for Windows 95/98/ME/NT/2000/XP. Also included is 
the WinSSD application for testing and diagnostics. Price is $229 in low 
volumes. A non-RoHS version is also available. 


Sealevel Systems, Liberty, SC. (864) 843-4343. [www.sealevel.com]. 





ATCA Test Extender Board Speeds Prototyping 


To provide full access for testing or debugging to a circuit card un- 
der test, extender boards must bring the card completely out of its card 
cage or enclosure. A new ATCA test extender board from Elma Bus- 
tronic extends both the power 
and Intelligent Platform Man- 
agement Bus (IPMB) signals. 


With a 10-layer stripline 
design, the ATCA extender 
board is designed for the fully 
populated fabric slot (5 ZD 
connectors, P20 through P24) 
and the power connector J10. 
The Zone 3 section is served 
by a blind board assembled to Zones 1 and 2 through the frame. The 
flexible design of the Zone 3 area allows customization for a minimum 
cost, since only the blind board must be changed to the required con- 
figuration. The complete keying system, including the Zone 3 area, is 
assembled. 


1 
itt 





The ATCA extender board has a sturdy metal frame with latching 
handles. The injector/ejector handles provide a secure and reliable con- 
nection to the chassis. Pricing is under $1,000 in volume, depending on 
configuration. 


Elma Bustronic, Fremont, CA. (510) 490-7388. 
[www.elmabustronic.com]. 


Serial RapidlO Distributed Switch Solution for 
VME Systems 


The dual Pentium 1 GHz 74587 VME PowerNode3 SBC from 
Thales Computers now interconnects through PMC-RIO serial RapidIO 
switch fabric PCI mezzanine cards. Thales has bundled the performance 
of PowerNode3 with a switch fabric solution to interconnect computing 
nodes all together inside a signal processing node. The Serial RapidIO 
technology reduces pin counts while staying full duplex and provides a 
low latency packet-based interconnect data push. Additionally, the tech- 
nology offers a very high degree of error management and provides a 
state-of-the-art architecture for reporting, and recovering from, trans- 
mission errors. The PMC-RIO mezzanine boasts its own distributed 

switch that prevents any system single point of 
failure. The switch fabric al- 
lows an aggregate through- 
put of up to 1.6 Gbytes/s 
thanks to the 400 Mbyte/s 
sustained link bandwidth (peer 
to peer), making the bundled card 
suitable for demanding signal process- 
ing applications. 







Thales ships the card—one PowerNode3 
featuring dual 1 GHz, 512 Mbyte memory, 32 Mbyte Flash along with 
RapidIO PCI mezzanine card mounted and tested, via LRU integration 
tests as a whole unit together with a complete Lynx 4.0 (and VxWorks 
6.2 in further versions) software suite. Pricing for the bundled Power- 
Node 3 and PMC-RIO card starts at $10,200. 


Thales Computers, Raleigh, NC. (919) 231-8000. [www.cetia.com]. 


Two Boards Offer Fully Packaged InfiniBand- 


based ATCA 


An AdvancedTCA pair including a node blade and a switch blade 
provide an InfiniBand-based fabric for high-performance blade systems 
utilizing the ATCA open architecture standard. The ATC5232 node 
board is a dual Intel Xeon-based PICMG 3.2-compliant processor board 
for wireless access/edge, telecom fiber transport, media gateways, soft 
switches and Internet IP-based applications. Onboard I/O peripherals 
are two auto-negotiating Gigabit Ethernet controllers for the base in- 
terface, two 10 Gbit/s x4 InfiniBand ports for the fabric interface, one 
64-bit/66 MHz PMC site for user configuration and other peripherals 
designed for high-performance Telco needs. Two 2.5 GHz x8 PCI Ex- 
press links are available at the RTM connectors. 


The ATS2148 Hub Board is a 3.0 and 
3.2 Option 1 switch, which provides 
? separate control plane switching, 
data plane switching and stor- 
age plane switching for 
ATCA shelves. It supports 
Gigabit Ethernet on the base 
control network. The fabric features a 10 Gbit/s In- 
finiBand switch with built-in InfiniBand Subnet Management 
Agent (SMA) and Performance Management Agents (PMA). Multi- 
pathing and automatic path migration are fully supported enabling fault 
tolerance and failover as well as providing notification that a self-healing 
fabric event occurred. Singe-unit pricing for the 5232 starts at $3,440 
and for the ATS2148 at $4,629. 


Diversified Technology, Ridgeland, MS. (800) 443-2667. 
[www.dtims.com]. 
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Pentium M CompactPCI SBC for Harsh Industrial 


Environments 


A system slot or stand-alone board for CompactPCI systems in sin- 
gle Eurocard format needs only one slot on the CompactPCI bus. The 
32-bit/33 MHz F14 board from MEN Micro is built around 
a Pentium M running at up to 2 GHz or—as 
an alternative—the low-power _. 
Celeron M at up to 1 GHz. With “= 
a dedicated heat sink the Fl4can 
be used in the extended tempera- 
ture range of -40° to +85°C. To meet 
the requirements for shock and vibra- 
tion the board has no plugged compo- 


nents. In addition the card is prepared for “™, - 
coating—for use in humid and dusty 
environments. 7 


The new 915GM chipset provides 
four PCI Express lanes for fast communication such as Giga- 
bit Ethernet or graphics, and two SATA interfaces. Standard I/O at the 
front panel includes VGA for graphics, two Gigabit Ethernet channels 
connected over PCI Express and two USB 2.0 ports. 


The F14 comprises up to 2 Gbyte fast DDR2 DRAM that is firmly 
soldered for shock and vibration-prone applications. A CompactFlash 
slot or a 1.8” hard disk can alternatively provide unlimited storage space. 
The F14 comes with board-support packages for Windows, Linux, Vx- 
Works and QNX. Single-unit pricing starts at $1,194. 


MEN Micro, Lago Vista, TX. (512) 267-8883. [www.menmicro.com]. 





VME SBC Targets Harsh Environments 


A highly reliable COTS single board computer for mission-criti- 
cal VME systems, the CPC600 from Fastwel is equipped with an In- 
tel Penttum M processor up to 2.1 GHz 
and supports up to 2 Gbytes of DDR 
SDRAM with ECC. The Intel Pentium 
M processor runs at up to 2.1 GHz 
and has up to 2 Mbytes L2 on- 
die cache at CPU speed over 
a 400 MHz processor system 
bus. All components includ- 
ing CPU, SDRAM and 32 
Mbyte solid-state disk may be soldered on- 
board thus providing superior shock/vibration resis- 
tance. Four Gigabit Ethernet ports and conformance to 
the VITA 31 specification make CPC600 an appropriate 
platform for robust redundant systems. Live Insertion support, smart 
temperature control, hardware monitor and watchdog timer position the 
CPC600 for mission-critical applications. 









A key element of the crash safety subsystem is 32K of non-volatile 
RAM where user applications can keep critical data and system log, 
which should be kept even if power fails. CPC600 also has 64K EE- 
PROM memory for user applications. The board can withstand high 
shock and vibration with operating temperatures from 40° to +85°C. 
Software support includes Microsoft DOS 6.22, Fastwel DOS 6.22, 
Windows 2000/XP/CE, QNX and Linux. Single-unit pricing starts at 
$2,987. 


Fastwel, Moscow, Russia. +/7(095) 234-0639. [www.fastwel.com]. 
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Industrial Computer Features Detachable 
Display, Computer Unit 


Industrial computer systems for the ever-changing factory environ- 
ment must be reliable and easy to upgrade with a minimum of capital 
reinvestment. In response, Arista has released the ARP-1720AP series 
of industrial 20.1-in. LCD panel computers with an LCD monitor and 
modular computer unit that can both be detached for easy maintenance 
and quick upgrades. 


The ARP-1720AP’s display is a NEMA 4 Panel Mount 20.1-in. 
LCD. The unit provides a variety of configurations ranging from a PHI 
CPU to a powerful P4 CPU. 


Up to 1 Gbyte of system mem- 
ory is supported. Single or multiple 
PCI and ISA slots are also avail- 
able. The ARP-1720AP can be 
configured with either a DC 24V 
or 100-230 VAC input power sup- 
ply. Multiple expansion slots and an 
optional RAID-1 are available for 
certain configurations. 

The ARP-1720AP supports ei- 

ther Windows 2000 Pro or XP Pro, and can run 
virtually any control, data acquisition or SCADA software package. 
Pricing for the series starts at $4,000. 


Arista, Fremont, CA. (510) 266-1800. [www.aristaipc.com]. 





3U PXI/CompactPCl Digitizers Target Fast Test 
Apps 

High-speed testing applications require fast data acquisition and 
testing rates. With that in mind, Acgiris has introduced the 10-bit, 4 
Gsample/s, 3U PXI/CompactPCI dual-channel DC152 and single-chan- 
nel DC122 digitizers, with input bandwidths of up to 3 GHz. 

The single-slot digitizers incorporate Acqiris’ proprietary chipsets, 
the XLFidelity ADC front-end and the JetSpeed II A/D converter. The 
dual-channel DC152, with 2 GHz of bandwidth, provides synchronous 
sampling of 2 Gsamples/s on both input channels with up to 256 Mpoints 
of optional acquisition memory. The single-channel DC122 offers sam- 
pling rates of up to 4 Gsamples/s with 512 kpoints of 
standard, or 512 Mpoints of optional, 
acquisition memory. DC122 op- 
tions include standard or high- 
frequency front-ends. 





The 50 ohm input stage of 
the DC122 standard front-end is 
fully protected against overvolt- 
age signals. The XLFidelity pro- 
vides input voltage ranges from 
50 mV to 5V full scale (in a 1, 

2, 5 sequence) with variable voltage offset. The 

DC122 high-frequency input front-end gives direct 

access to the XLFidelity crosspoint switch. The full-scale range is fixed 

at 1V, providing a bandwidth of 3 GHz. The input channel has an over- 

voltage protection to +3V. The DC152 and DC122 are supported with 

AcqirisLive and AcqirisMAQS software and Windows, Linux and Vx- 
Works drivers. Pricing begins at $16,480. 


Acgiris, Monroe, NY. (877) 227-4747. [Wwww.acgiris.com]. 
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Software Radio Development Platform with SCA 
Compliance 


A development platform containing all hardware and software 
tools required for developing software-defined radio is compliant with 
the Software Communication 
Architecture (SCA) mandated 
for all future U.S. military ra- 
dios. The Pentek SCA 2510 
hardware platform consists of 
a Pentek 7640 software radio 
transceiver PCI card installed in 
a PC workstation. The computer 
is loaded with the Linux operat- 
ing system, a set of development 
tools and the SCARI++ SCA core 
framework from Communications Research Centre (CRC) Canada. The 
hardware and software are fully integrated and the PCI card comes pre- 
configured with drivers and libraries. 


The Model 7640 Dual Channel Transceiver PCI board digitizes 
HF or IF input signals using a pair of 14-bit, 105 MHz A/D converters 
and generates output signals with two 16-bit, 500 MHz D/A converters. 
The 7640 is also equipped with a Virtex-II] Pro VP50 FPGA that serves 
as a control and status engine with data and programming interfaces to 
each of the many onboard resources, including a four-channel digital 
down-converter, a digital up-converter and a clocking and synchroni- 
zation system. The software development environment is based on the 
SCARI++ SCA Core Framework, Component Development Library 
and Software Defined Radio (SDR) Development Toolset. 


Single-seat pricing starts at $89,995. Discounts are available for 
additional seats. 


Pentek, Upper Saddle River, NJ. (201) 818-5900. [www.pentek.com]. 


Motor Control Developers’ Kit Delivers High 
Performance 


Developers of low-cost, high-end motion control applications need 
solutions for building high-performance, stand-alone mo- 
tor controllers, amplifiers or intelligent drives. 
With that in mind, Performance 
Motion Devices’ DK73110 De- 
velopers’ Kit for the company’s 
MC73110 Brushless Motor Con- 
trol IC has high-efficiency on- 
board MOSFET amplifiers that 
boost speed and performance. 


PMD’s DK73110 is a complete, 
integrated intelligent amplifier that includes a MC73110 
IC. It incorporates a velocity loop, a current loop, commutation, high- 
performance half-bridge MOSFET switchers and integrated motor con- 
nections. A proportional integral current control algorithm drives the 
power stage. The kit can also be used to develop a custom amplifier 
using external high-power switching circuits. The DK73110 drives a 
three-phase brushless motor at up to 10 amps, inputs analog or digital 
command signals and requires a single-voltage high-power input. 

The DK73110 comes with PMD’s C-Motion API, which can be used 
to write applications with standard C and C++ language commands, and 
PMD’s Pro-MotionGUI, a Windows-based program for exercising the 
motor hardware. Pricing for the DK73110 is $495. 

Performance Motion Devices, Lincoln, MA. (781) 674-9860. 
[www.pmdcorp.com]. 











Lab Kit Simplifies Development Time for 
Temperature Sensing Apps 


A development lab kit for temperature sensors makes develop- 
ment of temperature sensing applications fast and efficient by allowing 
simultaneous display and evaluation of four signals: data, minimum, 
maximum and average. The TSic LABkit from ZMD America enables 
designers to evaluate up to four of its TSic sensors at one time. Data can 
also be recorded in a text file that can be imported by other applications, 
such as Microsoft Excel. The kit includes four TSic 306 e-line sensor 
ICs with an accuracy of +/- 0.3°C with a l-meter cable; aTSic LABkit 

USB Adapter for up to four temperature sen- 

sors, including a USB cable and a recorder 

for data acquisition, display and recording 
software for PC/Windows. 


The TSic family of digital tempera- 
ture sensors are tested and calibrated 
by IST AG to provide absolute accuracy 

when delivered to customers. For exam- 

ple, the TSic 506F features a resolution of 
0.034°C, while the TSic 106, TSic 206 and TSic 
“ 306 feature a resolution of 0.1°C. 


Designed as a high-performance, cost-effective solu- 
tion, the TSic also offers low power and fast response time. The devices 
are ideal for temperature sensing in automotive applications, industrial 
and process control equipment, information technology products such 
as PCs, hard disk drives, consumer products, medical instrumentation 
and white goods. The TSic LABkit is priced at $185. 


ZMD America, Melville, NY. (631) 549-2666. [www.zmd.biz]. 


Digital Motor Control Development Kit Supports 
MATLAB 


To improve R&D and reduce development time for digital motor 
control applications, Technosoft has introduced the MCK2812 Kit C Pro- 
MS(BL) motor control kit, which supports MATLAB’s automatic C code 
generation. The kit comprises a complete motor control development 
platform, including necessary hardware (motor, sensors, power inverter) 
and development software, such as the MATLAB system model and 
complete DSP source code for a brushless motor control application. 


The MCK2812 Kit is based on an SK2812 board with a 150 MHz 
TMS320F2812, 128 Kw external RAM, 2 x 12-bit D/A outputs and RS- 
232, CAN-bus and JTAG interfaces. Hardware components include a 

PM50 3-phase inverter power module, 

a brushless motor equipped with 

Hall sensors and a 500-line 

encoder and a real-time serial 

communication monitor. Soft- 

ware includes PROCEV28x_pro- 

cessor evaluation software with ASM/C 

source code, DMCD28x-Pro Digital Motion 

Control Developer software with reference and trace 

functions, brushless motor control demos for sinusoidal 

mode and a DMCode-MS(BL) Source Code library for position and 

speed control, including a MATLAB-Simulink model of the complete 
motor control structure. 
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The kit comes with TI’s C-compiler, assembler and linker tools, as 
well as user and reference manuals for the kit and the TMS320F2812 
DSP controller. Price is $4,995. 

Technosoft, Bevaix, Switzerland. +41 32 732 55 OO. 
[www.technosoftmotion.com]. 
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RS232 Module Enables Data Transmission up to 
1.2 km 


Data transmission via serial interfaces is both a simple and safe 
method to connect computers with each 
other or with peripheral components. SMA 
Computers has developed a range of plug- 
on boards, called piggybacks, which are 
able to adjust the serial interfaces on SMA 
modules according to the various physi- 
cal standards. Now available is a special 
RS232 piggyback allowing a point-to-point 
connection over a distance of 1.2 km. 





Serial data transmission according 
to the RS232 standard represents the simplest way of exchanging data 
between two participants. But, the cable length is limited to approxi- 
mately 20 meters. The 422S4PB-GI1 piggyback enables connections 
over a distance of up to 60 times this far. 


The technology can be used with all SMA modules that have se- 
rial interfaces. By using the piggyback, the interface signals TxD, RxD, 
RTS and CTS of the serial interface module are converted and electri- 
cally separated into differential RS422 levels. The receive lines of the 
RS422 interface are terminated with 120 ohm on the piggyback and 
equipped with bias resistors (680 ohm) in order to permanently apply 
a valid voltage level (> +200 mV) to the receiving module. The RS422 
signal levels make it possible to transmit data over a distance of up to 
1.2 km when using suitable cables. The piggyback must be installed on 
each component. The 422S4PB-GI1 is priced at $78. 


SMA Computers, Fountain Valley, CA. (714) 593-2338. 
[www.SMAcomputers.com]. 


CompactPCI Express Backplanes Target Video 
Graphics Apps 

A new generation of video graphics applications requires higher perfor- 
mance, especially in backplanes. The four-slot CompactPCI Express EXPO 
backplane from Elma Bustronics has a 10- 
layer stripline design and contains a system 
slot, one Type | slot and two Type 2 slots. 


Based on the new PICMG speci- 
fication, the EXPO backplane is back- 
ward compatible to CompactPCI and 
also supports next-generation PCI Ex- 
press architecture in the familiar 6U-160 Eurocard form-factor. Cards 
are connected via a serial point-to-point bus with a read-only bandwidth 
of up to 2.5 Gbits/s (16x) or 2.5 Gbits/s full duplex (8x). Support for sev- 
eral different card form-factors are provided, with connectivity in Ix, 
2x, 4x and 8x increments. Each link is 2.5 Gbits/s full duplex. Support 
of legacy 32- or 64-bit CompactPCI boards is accomplished by a PCIe- 
to-PCI bridge. Because the CompactPCI Express architecture supports 
the P3, P4 and P5 connectors in all 6U slot types, it can continue to sup- 
port all existing CompactPCI secondary architectures, such as PICMG 
2.5, 2.20, 2.16, 2.17 and 2.18, either as functions on native cPCI Express 
cards or as legacy cards in the original cPCI form. 


Pricing for the 4-slot EXPO backplane is under $400, depending on 
volume and configuration. 


Elma Bustronic, Fremont, CA. (510) 490-7388. 
[www.elmabustronic.com]. 
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C265 “CONDOR” 
High Performance, Upgradeable Pentium® M Processor 


: Up to 2GHz Pentium® M processor with 2MB of L2 Cache 

« Field upgradeable CPU module to deploy latest Pentium® M 
including the new Dual Core processors 

* Ultra-low power requirements as low as 12W Max 

* Highest performance per watt processor module available 

-Up to 2GB of 266MHz DDR SDRAM via SODIMM module 

- Dual Gigabit Ethemet ports to front panel, rear or PICMG 2.16 PSB 

« Dual Video to support two DV! monitors or DVI & RGB together 

* 2D/3D video acceleration with OpenGL® and Direct-X® support 

« One PCI-X compliant PMC site with rear I/O 

: Onboard support for one IDE HOD or Compact Flash 

«One SAM™ I/O expansion module for Audio, 1394 or custom I/O 

- Four USB-2.0 ports and two serial ports 

« 1600 Gate user configurable CPLD, with 32 GPIO lines 

«Up to 512K8 of BlOS/user Flash 

: ATC with field-replaceable battery 

: CPU temperature and voltage monitoring for safe operation 

* Fully Hot Swappable 64-bit/66MHz CPCI bus 

* Available in standard 0°-55°C or extended temp. -40° to 85°C 

« Support for Windows® XP/2000, VxWorks® and Linux® 


"Also available in VME 


ENGINEERED LIKE NO OTHER 


FROM CONCEPTS TO DEPLOYMENT 


S620 "HAWK" 
Mini-ITX with Dual PMC Pentium® M System 


* Ultra small footprint systam, Only 2° x 7" x 10° and 4.5 Ibs. 
« Up to 2GHz+ Intel Pentium® M processor with up to 2MB of Le Cache 
: Field upgradeable CPU module to deploy latest Pentium® M processors 
* Ultra-low power requiraments as low as 10W max 
: Up to 2GB8 of 266MHz DDR SDRAM 
* Dual Gigabit Ethemet with TCP/IP Offloading Engine 
- High performance dual video display (RGB and DVI) with 64MB RAM 
* Dual Video to support two DV] monitors or DV! & RGB together 
- 2D/SD video acceleration with OpenGL® and Direct-X® support 
« Two PCl-Mezzanine-Carrier (PMC) sites for high-speed I/O expansion 
* Dual channel Line-In/Out/Mic with CD audio for full Multi-Media support 
* Dual 13944 and quad USB-2.0 ports via frontirear W/O panels 
* Ultra quiet fan for Desktop applications 
« Full power management control: 
“AGP! 1.0 compliant (suspend to memory/disk etc.) 
-Geyserville® Ill support (speed and voltage stepping) 
“Support for battery operation and standby power 
: CPU, system temperature and voltage monitoring for safe operation 
‘Up te 1006B ATA HDD and CD-AW/DVDzAW Drive 
: Support for Windows® XP/2000, VxWorks® and Linux® 


LEADING THE EMBEDDED MARKET SINCE 18978. 





COMPUTING 


Set ee 


PERFORMANCE, RELIABILITY, LONGEVITY 


GENERAL MICRO SYSTEMS, INC. 


TEL(800) 307-4863 « gms4sbe.com 





Performance Beyond Limits 
UCLA AMAL MULL eee MOOT Le LaLaLe Es 
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ETXexpress products are next generation 
embedded modules based on the PICMG 
COM Express standard. ETXexpress 
provides the hightest performance and 
1/0 bandwidth available in COMs. 


» PCI Express - the elemental data path 
» Gigabit Ethernet - for high connectivity 
» USB 2.0 - for fast periphery 

I » Serial ATA - for fast drives 
ETXexpress-PM | » ACPI - for optimized power management 
» Highest performance state of the art embedded module 
» Intel” Pentium® M processor and advanced Intel® chipset 





Get ready. Get ETXexpress 
Visit www.kontron.com/ETXexpress 
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